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1.0  INTRODUCTION 


In  the  past  few  years,  considerable  attention  has 
been  devoted  to  evaluating  and  optimizing  the 
performance  of  commercial  digital  communication 
systems.  Several  studies  have  been  devoted  to  the 
performance  improvement  that  can  be  achieved  through  a 
more  sophisticated  design,  and  the  mathematical  level 
of  these  studies  is  growing  with  the  complexity  of 
these  systems.  Thus,  the  evaluation  of  their 
performance  becomes  an  increasingly  more  demanding 
task . 

The  aim  of  this  report  is  to  introduce  some  new 
techniques  for  the  analysis  and  design  of  digital 
communication  systems.  The  emphasis  here  is  placed  on 
the  computational  algorithms  that  avoid  resorting  to 
simulation  techniques,  which  might  be  either 
prohibitively  expensive,  or  not  completely  satisfactory 
in  terms  of  accuracy. 

Thee  digital  communication  systems  that  can  be 
studied  using  the  techniques  described  in  the  following 
are:  high-rate  transmission  systems  with  limited 
bandwidth;  satellite  and  microwave  radio-relay  links 
with  nonlinearities;  multi-path  and  fading 
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2.0  EVALUATION  OF  ERROR  PROBABILITIES 


In  recent  years,  there  has  been  an  increasing 
demand  for  the  analysis  of  digital  communication 
systems,  partly  due  to  the  availability  of  large 
numbers  of  novel  digital  signal  processing  devices  and 
partly  due  to  certain  attractive  features  of  digital 
systems,  as  for  instance  the  fact  that  digital  signals 
are  more  amenable  to  enciphering  and  deciphering  than 
analog  signals  for  secure  communication  purposes. 
Performance  evaluation  of  these  systems  is  generally 
based  on  the  average  error  probability  criterion. 
However,  as  digital  communication  systems  become  more 
sophisticated,  it  turns  out  that  many  of  the  known 
analytical  and  simulation  techniques  available  for  the 
computation  of  error  probability  are  either  not 
applicable  or  prohibitively  expensive.  This  is  for 
example  the  case  of  digital  communication  systems  such 
as:  systems  operating  on  a  nonlinear  bandlimited 
channel;  satellite  systems  operating  in  a  channelized 
environment  and  a  nonlinear  transponder; 
spread -spectrum  multiple  access  systems;  microwave 
systems  with  cochannel  interferences.  Despite  the 
seemingly  diverse  nature  of  these  systems,  a  unifying 
feature  behind  all  of  them  is  that  at  their  receiver 


output  there  is  a  desired  signal  component  disturbed  by 
an  interference  term  plus  noise.  Let  the  interference 
term  be  denoted  by  X.  Then,  the  error  probability  for 
all  these  systems  can  be  expressed  as  an  expectation  of 
a  suitable  known  function  with  respect  to  the  random 
variable  X.  Usually,  this  expectation  cannot  be 
evaluated  directly,  either  because  the  statistics  of  X 
are  not  known  or  because  its  computation  would  take  too 
long.  Thus,  various  bounding  and  approximation 
techniques  have  been  considered  .[ 1 - 1 9 ] 

Basically,  these  techniques  use  a  set  of  moments 
of  the  random  variable  X.  Ref.  [33]  includes  which  is 
probably  the  most  comprehensive  approach  to  this 
problem.  It  describes  a  class  of  upper  and  lower  error 
bounds  that  can  be  evaluated  with  modest  computational 
effort,  and  can  be  very  tight,  particularly  when  a 
large  number  of  moments  can  be  evaluated  with 
sufficient  accuracy.  These  bounds  can  use  an  arbitrary 
number  of  moments  of  the  random  interference,  and  are 
based  on  Krein-Nudel ’man  theory  [2u-<?  1],  They  provide 
the  tightest  possible  bounds  based  on  a  given  moment 
information,  and  generalize  several  previously  derived 
theories . 
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In  many  situations,  however,  use  of  moment 
bounding  techniques  cannot  be  directly  applied,  either 
because  even  the  momens  cannot  be  evaluated  exactly  or 
because  the  additive  noise  does  not  have  a  known 
probability  density  function.  We  consider  separately 
these  two  situations. 

2.1  Probability  Of  Error  For  Digital  Systems  With 
Inaccurately  Known  Interference 

In  this  section  we  extend  the  moment  bound  theory 
to  the  computation  of  error  probabilities  to  a 
situation  in  which  the  moments  of  the  interference  are 
known  only  within  a  given  accuracy. 

The  model  assumed  for  the  analysis  is  the 
following:  consider  a  digital  communication  system 
where  the  output  decision  random  variable  at  each 
sampling  instant  is  given  by 

(2.1)  R  =  ah  +  X  +  v» , 

where  a  is  the  input  information  symbol  taking  on 
values  in  a  finite  set,  h  is  a  constant,  v>  is  an 
arbitrary  random  noise  with  known  cumulative 
distribution  function,  and  X  is  the  random 
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interference,  which  is  only  known  through  a  set  of 
intervals  in  which  its  moments  lie. 

A  situation  like  this  can  occur  in  several 
instances.  For  example,  the  exact  statistics  of  the 
interference  may  not  be  known,  and  only  estimates  of 
the  first  moments  may  be  available.  Another  example 
arises  when  X  represents  the  intersymbol  interference 
generated  by  a  channel  impulse  response  whose  samples 
are  known  only  in  a  certain  interval.  This  may  occur 
because  those  samples  have  been  measured  with  finite 
accuracy,  or  because  we  want  to  estimate  the  error 
probability  for  a  class  of  channel  impulse  responses. 

The  solution  to  this  problem  can  be  obtained  using 
a  theory  developed  by  Krein  and  Nudel'man  in 
conjunction  with  the  so-called  " Chebyshev-Markov 
problem  with  moments  in  a  parallelepiped" [ 2 1  ] .  This 
theory  deals  with  the  evaluation  of  upper  and  lower 
bounds  of  the  integral 

fk 

U.J)  :  =  G  (  x  )  d  P  (  x  ) 

“'a 

where  G(.)  is  a  known  function,  and  PC.)  is  the 
cumulative  distribution  function  of  a  random  variable 
whose  first  n  moments  are  approximately  known. 


An  outline  of  the  theory  of  moment  bounds  with 
moments  in  a  parallelepiped,  as  well  as  the  derivation 
of  computational  techniques  for  its  solution,  are 
included  in  Appendix  B. 

2.2  Probability  '•!'  hrror  For  Disturbances  Known  Only 
Through  Their  Moments 


Another  case 

of 

interest  arises 

when  a  signal 

of 

known  statistics 

is 

per turbed 

by 

i nter  f er ence 

and 

noise,  both  of 

whom 

are  known 

only  through  their 

moments.  In  this  case  the  situation  is  the  following: 
the  observed  signal  is 

(2  .  K  =  ah  +  X 

where  a  is  a  random  variable  representing  the  useful 
signal  and  whose  probability  density  function  is  known; 
X  is  a  random  variable  representing  the  disturbance, 
and  whose  first  '  n  moments  are  known.  We  want  to 
evaluate  the  probability  that  R  crosses  a  given 
threshold  t,  i.e.,  the  quantity 

G  R ( t )  =  Prob[ R  >  t ] 


(2. 4) 
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are  taken  in  each  waveforms,  2  complex  values  must  be 
stored,  and  1024  256-point  Fourier  transforms  must  be 
evaluated  . 

For  this  reason,  a  particular  attention  has  been 
devoted  to  the  computational  shortcuts  that  can  be 
devised  to  reduce  the  computational  complexity  and/or 
the  storage  requirement  of  the  algorithm.  Symmetries 
arising  in  the  signal  set  have  been  taken  into  account, 
thus  reducing  the  dimensionality  of  the  signal  set  by  a 
factor  of  M  in  most  cases  of  practical  importance. 
Moreover,  the  particular  structure  of  the  matrix  P  has 
been  exploited,  resulting  in  an  iterative  algorithm  for 
the  computation  of  the  matrix  A(f)  in  equation  (3-17). 

*v* 

The  resulting  computer  algorithm  can  compute  the 
power  spectrum  of  a  digital  signal  with  M  up  to  4,  and 
a  memory  L  up  to  5,  in  a  reasonable  time.  An  example 
of  the  results  that  can  be  obtained  by  using  this 
program  is  reported  in  Fig. 2. 


In  conclusion,  the  computation  of  the  power 
spectrum  can  be  performed  through  the  following  steps: 


1.  Determine  the  waveforms  that  are  available  at 
the  channel  output,  and  their  Fourier 
transforms  (use  FFT,  typically).  Arrange  them 
as  the  components  of  the  vectors  O^Cf). 

2.  Compute  the  quantities  c  (f),  c^Cf),  cQ(f), 
and^t(f)  from  eqs  .  (  3  .  1  1 )  -  (  3 . 1 5) 

3.  Compute  the  power  density  spectrum  using 
(3- 1 6) -C  3 . 18) . 

Although  all  these  steps  are  computationally 
straightforward,  Step  1  may  put  a  considerable  burden 
in  terms  of  computer  time  and  storage.  In  fact,  if  L 
denotes  the  channel  memory,  and  M  the  source  alphabet 
size,  i.e.,  the  number  of  different  source  symbols,  the 
number  of  waveforms  to  be  stored  is  M  .  Moreover, 

each  waveform  must  be  represented  with  an  adequate 
number  of  samples,  and  its  Fourier  transform  must  be 
taken.  Thus,  for  example,  if  L=4,  M=^ ,  and  256  samples 
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The  third  quantity  is  the  average  amplitude 


spectrum  of  the  waveforms  available  from  the  channel: 

(3.1*0  J*(f)  =  w 

Finally,  the  fourth  quantity  of  interest  is  the 
average  energy  spectrum  of  the  waveforms  available  at 
the  channel  output: 

(3.15)  ct(n  =  £  PkQ’<ns3^f' 

k.*i 

The  continuous  part  and  the  discrete  part  of  the 
power  density  spectrum  of  the  signal  y(t)  are  then 
given  by: 

(3.16)  Y()‘ 4 [<=•«■>- !/*«>!* ]♦  f  [ctCOACfj  )J 


and 


(3.17)  jl  cf)- z 
3<*  T' 

where 


(3.18)  A(f )  *  £  [  P11"1-  P*] 

A«  1 


we  evaluate  four  quantities  that  play  the  fundamental 
role  in  the  expression  of  the  power  spectral  density  we 
are  looking  for.  The  first  is  the  average  value,  taken 
over  the  source  symbols,  of  the  vectors  Q^(f): 

M 

(3.11)  S2(f>  »  I  pk  9k(f> 

k-i 

The  i-th  component  of  £2(f)  is  then  the  average  Fourier 
transform  of  the  waveforms  available  to  the  channel 
output  when  the  channel  is  in  state  i. 

The  second  quantity  is  the  vector  c^f)  whose  j-th 
component  is  the  average  Fourier  transform  of  the 
waveforms  that,  when  output  by  the  modulator,  force  it 
to  the  state  j .  We  have 

(3.12)  c,(f)  =  £  pkQk(f)DEk, 


where 


(3.13)  D  =  diag  [ w1 , w2, • . . ] 

and  [E^]^  =  1  if  the  source  symbol  takes  the  channel 
from  state  i  to  state  j,  and  zero  elsewhere. 
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We  are  now  in  a  position  to  derive  the  power 
density  spectrum  of  the  digital  signal  (3.1).  Only  the 
final  results  will  be  given,  as  the  details  can  be 
found  in  the  book  [25]. 

Let  us  denote  by 
(3-8)  wt  =  Prob{<Tn  =  i} 

the  stationary  state  probabilities  of  the  Markov  chain, 
i.e.,  the  components  of  the  unique  probability  vector  w 
such  that 

(3.9)  w  P  =  w  . 

fSj  A*  Ml 

Denote  then  by  Q.(f)  the  row  vector  whose  entries  are 
**  k 

the  Fourier  transforms  of  the  waveforms  q(t;.,.), 
according  to  the  following  rule:  the  i-th  component  of 
the  vector  Q^(f)  is  the  transform  of  q(t;A^,i).  That 
is,  Q^(f)  includes  the  Fourier  transforms  of  the 
waveforms  corresponding  to  the  source  symbol  A^,  for 
the  different  channel  states.  Letting  also 


P 


k 


(3.iu) 


Prob{  a  =  A,  } 
n  k 


,  k= 1 , . . . , M , 


As  one  can  see,  from  state  (xyz)  the  shift 
register  can  move  only  to  state  (wxy),  with  probability 
p  if  w  =  o,  and  p  if  w  =  2. 

Consider  then  the  m-step  transition  probabilities. 
These  are  the  elements  of  the  matrix  Pm.  As  the 
channel  state  depends  on  L  symbols,  and  these  are 
assumed  to  be  statistically  independent,  the  state 
CTn+m ,  m>L,  is  independent  of  the  state  C”n  ,  so  that  we 
have 

(3.6)  L 

=■  TT  P  {<»,.*  AjJ  >  miL 

J>i  * 

Thus,  PL  =  PL’VA  =  .  .  .  ,  and  Pu  has  identical  rows.  We 
can  write 


U  00 

(3-7)  P  =  P  , 

which  shows,  in  particular,  that  the  channel  state 
sequence  (0^)  is  a  fully  regular  Markov  chain,  i.e., 
all  eigenvalues  of  the  transition  probability  matrix  P 
having  unit  magnitude  are  identically  1,  and  1  is  a 
simple  eigenvalue. 


p. .  is  given  by 


Pij  "  P{°n*  (^j,  »  —  »  *0  l°n-i  *(Aii  »  *0} 


(  3  •  ^)  *  P  {&n-«  *  Aj^  >•••>  ^n-L*  Aj^  |  8  Aj^  »••• )  &n-w *1  ”  ^iL  } 


*  Plan-t*  Aj,)  Si,j,  Sl,j5  •"  ■ 


where  denotes  the  Kronecker  symbol  <s«  = 1 ,  and 
81j  =o  for  i*  j )  . 

As  an  example,  assume  M=2,  A^sG,  A  si,  and  L=3. 
The  noiseless  channel  has  eight  states,  whose 
lexicographically  ordered  set  is 
l(OGo) ,(GG1) ,(o1u) ,( 111) ,( 1GG) ,(101) ,( 11G) ,(111)} .  The 
transition  probability  matrix  of  the  corresponding 
Markov  chain  is 


(000)  (001)  (010)  (011)  (100)  (101)  (110)  (Ilf) 

Po0  o  O  A0  ool 
poOOOp,000  (001) 
Op#0  OO  p,  O  0  (010) 

OpjOOO^OO  (Oil) 
OOftOOOftO  (100) 
00*000^0  (101) 
ooop^ooopi  01®) 

.00  O  po  0  0  O  P,J  (111) 


p  - 

A* 

(3.5) 
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channel  states,  which  forms  a  Markov  chain.  If  each  of 
the  symbols  an  can  take  on  M  values,  the  states  can 
take  on  M*"  values. 


To  derive  the  transition  matrix  of  the  Markov 
chain  of  the  channel  states,  we  shall  first  introduce  a 
suitable  ordering  for  the  values  of  0^  .  This  can  be 
done  in  a  rather  natural  way  by  firstly  ordering  the 

elements  of  the  set  U  ,Ag . }  of  the  M  values 

taken  on  by  the  an  (a  simple  way  of  doing  that  is  to 
stipulate  that  A^  precedes  Aj  if  and  only  if  i<j),  and 
then  inducing  the  T/J  >wi.ng  "lexicographical"  ordering 

among  the  L-tuples  ( A j  . A j  , . . . , A j  ): 

i  2  L 

(  A  j  A  j  'j  precedes  CV-,Alt) 


(3-3! 


or 


if  and  only  if^ 


j,  *  ii  and  j2  <  *2  >  or 

j,*  »  j*  *  **  »  and  j3  <  i-3  i etc. 


Once  the  state  set  has  been  ordered  according  to 
the  rule  ( 3 • 3 ) ,  each  state  can  be  represented  by  an 
integer  number  expressing  its  position  in  the  ordered 
set.  Thu3,  if  i  represents  the  state 
( A •  ,Ai  , . . . , Ai  ),  the  one-step  transition  probability 
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spectrum.  It  includes  modulator,  filters,  etc.,  and 
transforms  the  d i scr ete- t ime  input  sequence  into  a 
continuous  waveform  y(t).  We  assume  that  the  channel 
has  a  finite  memory,  i.e.,  that  at  time  t  the  channel 
output  y(t)  depends  only  on  a  finite  number,  say  L,  of 
past  source  symbols  besides  the  one  emitted  at  time  t. 
We  can  write 


oo 

(3-D  y(t)  =  £  q(t-nT;an,crn) 

r\»-oo 

where 


(3.2) 


=  ( 


an- 1 ’ an-2 ’ 


n-L' 


is  referred  to  as  to  the  state  of  the  noiseless  channel 
during  the  n-th  time  interval  ((n-1)T,nT).  The 
waveforms  q(t;an,C"n)  take  values  in  a  set  of 
deterministic ,  finite  energy  signals.  The  hypothesis 
that  the  channel  memory  is  finite  justifies  the 
assumption  that  this  set  includes  a  finite  number  of 
different  waveforms. 

The  emission  of  from  the  source  forces  a 

n 

transition  of  the  channel  state  from  <Tn  to  .  A 


sequence  of  source  symbol  thus  generates  a  sequence  of 


An  efficient  computational  technique  to  evaluate 
the  bandwidth  occupancy,  and  more  generally  the  power 
spectral  density  of  a  digital  signal,  was  developed  in 
C  .  This  technique  is  based  on  a  Markov  chain 
model  of  the  digital  signal,  and  is  general  enough  to 
handle  a  large  variety  of  situations:  for  example,  it 
can  include  the  effects  of  convolutional  coding,  linear 
filtering  and  nonlinear  processing  of  the  digital 
signal  . 

The  communication  system  to  which  our  theory 
applies  is  shown  in  Fig.1.  The  source  emits  a 
stationary  sequence  ( an)  of  discrete  independent 


Fiq.1  -  Noiseless  part  of  a  digital  communication 
system . 

random  variables  whose  statistics  are  known.  Each  an 
is  emitted  every  T  seconds.  The  noiseless  channel  is 
assumed  to  include  every  device  between  the  source  and 
the  point  at  which  we  want  to  evaluate  the  power 


The  need  for  modulation  schemes  that  employ 
efficiently  bandwidth  and  power  in  digital  radio 
communication  systems  has  led  to  the  extensive  use  of 
phase-shift  keying  (PSK).  As  power  spectra  of  PSK 
signals  exhibit  sidelobes  which  may  interfere  with 
neighboring  channels,  a  certain  amount  of  filtering  is 
necessary  at  the  transmitter,  to  provide  sidelobes 
removal.  The  nonlinearity  on  the  transponder  will 
however  affect  the  power  spectrum  shape  so  as  to 
restore  the  spectral  sidelobes,  and  this  effect  must  be 
accurately  controlled  in  order  to  avoid  unwanted 
interference  from  and  to  neighboring  channels. 

For  these  reasons,  it  is  useful  to  have  a 
technique  to  compute  the  power  spectrum  spread  produced 
by  nonl i near  1 1 ies  operating  on  digital  signals.  These 
computations  are  usually  performed  by  simulation,  i.e., 
by  applying  a  pseudorandom  sequence  to  a  model  of  the 
system,  and  taking  the  Fourier  transform  of  the  output 
signal.  Such  a  procedure,  besides  being  rather 
t ime-consum i ng  ,  usually  leads  to  results  showing  the 
power  spectrum  in  a  crudely  approximated  form. 


.V  . 
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3 . 0  POWER  SPECTRAL  DENSITY  OF  DIGITAL  SIGNALS 

TRANSMITTED  OVER  NONLINEAR  CHANNELS 

Efficiency  of  use  of  the  radio  spectrum  for  both 
terrestrial  annd  satellite  digital  systems  is  a  subject 
of  increasing  relevance  in  communications,  and  spectrum 
occupancy  is  a  significant  parameter  in  performance 
evaluation.  In  fact,  in  many  situations  data  streams 
from  users  are  assigned  adjacent  frequency  bands  that 
interfere  with  each  other  in  a  larger  or  lesser  extent 
depending  on  the  bandwidth  occupancy  of  the  modulated 
signal.  Thus,  spectrum  occupancy  is  a  rough  measure  of 
adjacent  channel  interference. 

Moreover,  for  spectrum  conservation  the  bandwidth 
occupancy  of  modulated  signals  must  be  kept  to  a 
minimum,  without  impairing  the  system  performance. 
This  is  particularly  relevant  in  the  presence  of 
channel  nonlinearities,  which  may  restore  the  signal 
spectrum  sidelobes  that  were  previously  removed  by 
linear  filtering.  An  important  example  where  this 
situation  occurs  is  provided  by  satellite  transponders 
in  which  a  nonlinear  device  is  present  --  e.g.,  a 
traveling-wave  tube  amplifier,  or  a  hard  limiter. 
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moments . 


The  solution  of  this  problem  can  again  be  found 
from  Krein-Nudel 'man  theory  of  moment  bounds  [21].  If 
n  moments  of  the  random  variable  R  are  known,  it  is 
possible  to  evualuate  upper  and  lower  bounds  to  (2.4) 
that  are  sharp,  i.e.,  no  other  bound  based  on  moments 
can  be  tighter.  Although  the  theory  is  rather 
involved,  the  algorithm  which  provides  the  result  is 
relatively  easy  to  use.  The  algorithm  itself  is  based 
on  the  search  of  the  roots  of  a  polynomial  which 
provide  the  points  of  increase  of  the  distribution 
functions  giving  the  upper  and  lower  bounds  sought. 
Another  algorithm,  which  appears  to  be  computationally 
more  stable,  has  been  obtained  and  is  presently  being 
developed,  and  we  hope  to  report  on  it  soon. 


More  generally,  we  can  assume  that  we  are  dealing 
with  a  random  variable  R  whose  first  n  moments  are 


known,  and  we  want  to  fim:  upper  and  lower  bounds  to 
the  quantity  (^.4)  for  any  given  t. 

This  problem  arises  in  several  instances.  For 
example,  we  may  want  to  determine  the  cumulative 
distribution  function  of  a  random  variable  made  up  as 
the  sum  of  a  number  of  independent  random  variables. 
The  exact  distribution  function,  apart  from  a  few 
special  cases,  is  difficult  to  determine,  but  as  the 
moments  of  a  sum  of  random  variables  are  relatively 
easy  to  compute,  we  can  use  this  theory  to  find  upper 
and  lower  bounds  to  the  distribution  function.  This 
works  better  that  the  assumption,  which  is  often  made 
in  this  situation,  that  the  random  variable  we  are 
dealing  with  are  Gaussian  --  this  may  make  sense  when 
the  number  of  random  variables  involved  is  very  large, 
but  has  still  to  be  verified.  Yet  another  application 
arises  when  a  digital  communication  system  is  perturbed 
by  a  noise  whose  exact  statistics  are  not  known. 
Sometimes  the  assumption  of  a  given  distribution  whose 
parameters  fit  those  measured  will  work,  but  it  may  be 
preferable  to  use  the  theory  presented  here,  which 
requires  no  hypotheses  to  be  added  to  the  knowledge  of 
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POWER  SPECTRUM  IDS) 


0 


0  1  2  3  4  5 

FREQUENCY,' BIT  RATE 


Fiq.2  -  Power  density  spectrum  of  a  binary  PSK  siqnal. 
a-  Before  filterinq 

b-  After  filterinq  (filter  is  a  4-pole  Butterworth 
with  a  3-dB  bandwidth  2.4  R,  R  the  bit  rate 
c-  After  a  TWT  driven  at  saturation 
d-  After  the  same  TWT,  with  a  6-dB  input  backoff 
e-  After  the  same  TWT,  with  a  12-dB  input  backoff. 
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This  figure  shows  the  power  density  spectrum  of  a 
binary  PSK  signal  first  filtered  through  a  fourth-order 
Butterworth  filter  ,  and  then  passed  through  a 
nonlinear  amplifier  driven  at  or  near  saturation.  It 
is  seen  that,  although  filtering  removes  the  sidelobes, 
they  are  restored  by  the  nonlinearity,  to  an  amount 
that  can  be  evaluated  with  a  good  accuracy. 


Other  avenues  of  attack  have  been  explored  as 
well.  One  which  at  first  seemed  particularly  promising 
is  the  use  of  Volterra  series  to  model  the  nonlinear 
channel.  This  approach  proved  to  be  very  fruitful  in 
the  context  of  evaluation  of  error  probabilities,  as 
explored  in  [26-27].  However,  Volterra  series  do  not 
appear  to  offer  any  particular  advantage  when  the 
nonlinear  channel  has  to  be  modeled  in  order  to  derive 
the  power  density  spectrum.  Actually,  our  practice  has 
shown  that  when  the  channel  is  made  up  by  cascading  a 
number  of  blocks  (typically,  filters  and  nonlinear 
memoryless  devices)  the  best  approach  to  derive  the 
waveforms  at  its  output  is  simulation.  After  this 
preliminary  simulation  has  been  performed,  the 
analytical  tools  previously  described  can  be  used  to 
compute  the  power  spectrum. 


OVER  NONLINEAR  CHANNELS 

A  problem  arising  in  digital  transmission  over 
nonlinear  channels  is  the  design  of  modem  filters. 
Filters  incorporated  in  present-day  modems  are  not 
necessarily  optimum  when  the  channel  is  nonlinear,  as 
they  are  usually  designed  with  the  Nyquist  theory  in 
mind.  The  early  INTELSAT  4-PSK  TDMA  modems  specified 
Nyquist  filtering,  with  a  30-percent  rolloff  Nyquist 
transmitting  filter  combined  with  an  f/sin  f  aperture 
equalizer.  The  receiving  filter  was  a  high-order 
elliptic  filter,  whose  bandwidth  was  selected  30  as  to 
maximize  the  noise  rejection  while  not  degrading  the 
Nyquist  response  of  the  transmitting  filter  [28]. 
Recent  research  is  aimed  at  selecting  optimum  filters 
for  a  nonlinear  channel  model,  either  by  choosing  the 
best  values  of  the  parameters  within  a  given  filter 
family  [29]  or  by  designing  the  frequency 
characteristics  of  a  filter  which  is  optimum  under  a 
specified  criterion  [30]. 

By  using  the  channel  model  described  in  the 
previous  Section  of  this  Report,  and  which  is  based  on 
a  set  of  output  waveforms  connected  to  form  a  Markov 
chain,  we  developed  a  theory  to  design  an  optimum 


26 
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receiving 

filter 

for  digital  transmission 

over  a 

nonlinear 

channel 

.  The  filter  is  chosen 

so  as  to 

*-  * j 

\  *'J 

4 

minimize 

the  mean- 

square  error  between  the  transmitted 

•  3 

• 

symbols 

and  the 

samples  of  the  demodulated 

waveform . 

Besides 

providing 

closed-form  expressions 

for  such 

optimal  filters,  it  was  shown  that  the  structure  of  the  • 


filter  corresponds  to  a  bank  of  matched  filters,  each 
followed  by  a  linear  transversal  filter.  This  result 

t 

is  a  generalization  of  a  well-known  property  of  optimum  • 

receiving  filters  for  linear  channels.  The  performance 
of  the  optimum  receivers  can  be  computed,  and  it3 
performance  compared  against  that  obtained  through  '  • 

conventional  designs. 

Some  results  obtained  are  reported  in  Appendix  C. 

It  must  be  noticed  that  the  full  generality  allowed  by 
the  theory  developed  there  has  not  been  exploited  in 
full.  We  hope  to  report  soon  on  this  topic,  by 
preparing  a  comprehensive  paper  covering  also  the 
connection  to  previously  known  theories,  and  showing 
the  applicability  of  our  theory  to  a  situation  in  which 
the  channel  disturbances  include  also  interference  from 
adjacent  channels. 
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applications  of  error-correct ing  codes,  extra  bits  must 
be  added  to  the  transmitted  symbol  sequence,  with  the 
modulator  operating  at  a  higher  rate,  and  hence 
requiring  a  larger  bandwidth.  On  the  other  hand,  use 
of  coding  can  decrease  the  power  requirement  necessary 
to  achieve  a  given  performance,  so  that  this  gain  in 
performance  (usually  referred  to  as  the  "coding  gain") 
can  compensate  for  the  sacrificed  bandwidth.  In  other 
words,  the  tradeoff  is  increased  complexity  and 
decreased  bandwidth  efficiency  for  increased  power 
efficiency.  With  Ungerboeck  codes,  one  can  avoid  the 
loss  in  bandwidth  efficiency  by  using  a  technique 
characterized  by  the  fact  that  the  redundancy  required 
by  the  coding  process  is  provided  by  increasing  the 
number  of  coded  symbols  instead  of  the  bandwidth.  With 
these  codes,  the  tradeoff  is  increased  complexity  for 
increased  power  efficiency. 

Some  of  the  results  we  obtained  in  the  study  of 
this  class  of  codes  are  included  in  Appendix  D  and 
Appendix  E.  In  particular,  we  considered  a  combination 
of  multidimensional  signal  sets  and  Ungerboeck  codes, 
an  idea  which  seems  to  be  very  promising  in  terms  of 
applications.  We  introduced  a  new  class  of  signal  sets 
for  modulation,  that  we  call  "generalized  group  codes" 
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and  which  are  based  on  a  peak-energy  constraint.  These 
alphabets  have  a  good  deal  of  symmetry,  a  feature  which 
is  apparently  necessary  to  design  good  Ungerboeck 
codes.  Moreover,  design  techniques  are  derived,  based 
on  a  partition  of  the  signal  set  stemming  from  the 
partition  of  a  group  into  cosets  of  a  convenient 
subgroup  (Appendix  D) . 

We  have  also  investigated  the  power  spectral 
density  properties  of  the  signals  obtained  from  an 
Ungerboeck  code.  In  particular,  we  have  shown 
rigorously  that,  under  certain  mild  symmetry 
contraints,  Ungerboeck  codes  do  not  alter  the  power 
spectrum  of  the  modulated  signals  (Appendix  E)  . 
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APPENDIX 


In  this  Append  i  «  up  compute  the  probabilit  !>*!'•  E'*.£t  =  J  I  -  *  !  Per 

the  situation  'Jescibed  in  Section  1  .  We  assume  that  the  random 
variables  in  the  sequence  (an)  “f  data  symbols  are  independent,  and 
-ake  on  values  in  the  set  A  -  { 0to,  oi . with  probabilities 

r,  ;  p  f  a  v  a.  )  .  It  an  be  seen  that  the  channel  suites  form  a 

k  "  W 

homogeneous  Markov  chain.  The  one-to-one  correspondence  between 

channel  states  and  the  integers  can  be  set  by  ordering  the 

elements  of  A  (for  ex  ampl  e  ,  0io  <  <x,  < .  .  .  <  <XM_, )  and  introducing  among  the 
i.+ i  ) -tuples  Ian,a^  )  the  lexicographical  order  induced  by 

’he  order  of  A.  The  states  can  then  be  represented  by  Integer  numbers 
from  i  to  M  by  expressing  their  position  in  the  ordered  set.  lhus,  if 
-epresents  the  state  (<x  .a.  ,  .  .  .  ,oc.  1  and  i  the  state 

O  ^ 

(M.  ,  CX;  ,  a,  )  ,  we  have 

Jo  Ji  Jl 


P  1  ,  =  l  It  . 

i  ♦  1  y  9 

•'  IT 

■i, . 

=  P  •  a  ,  -»a  .  ,  a 

Jo  ■ 

•“‘3, - •aj-L+r1; 

*  p .  •  1  • 

J  1  1  1  1  , 

'O  O'  1  1 

j  2  '  1.- 1  j  L 
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Assume  that  we  want  to  evaluate  the  power  spectrum  of  the  signal 
x(t)  at  the  output  of  the  channel  of  Fig.1.  If  0(f)  denotes  the  veo- 

NV 

tor  of  Fourier  transforms  of  the  waveforms  (q(t;i)}  ,  and  the  symbols 

ar  are  independent  and  Identically  distributed,  with  the  simplifying 
assumpt ion 

M 

l  q(t;i )  =  0 
i-.l 

we  have  the  following  result  for  the  power  spectrum  of  x(t)i 


0  '  <•  i 

x 


where  Re(.)  denotes  the  real  part,  the  dagger  denotes  conjugate 
transpose  , 


Av(f)  =  l 


i  ^ fT  *  *  r 


-jl.nfT 


t*T 

e 


and  P  is  the  M*M  matrix  whose  elements  are  the  probabilities 


( see  the  Append  lx). 


6.0  CONCLUSIONS 

We  have  described  a  number  of  parameters  (error  probability, 
minimum  distance,  cutoff  rate,  spectrum  occupancy)  that  are  useful  to 
evaluate  the  performance  of  a  digital  communication  system.  A  rather 
general  system  model  has  been  assumed,  which  is  easily  amenable  to 
analysis  and  is  based  on  the  hypothesis  of  a  finite  memory  for  the 
rran3mis3ion  channel.  Based  on  this  model,  we  have  described  the 
' omputational  techniques  that  can  be  applied  for  evaluating  those 
parameters. [ 14) 
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is  described  in  [10]. 


4.0  COMPUTATION  OF  THE  CUTOtk  RATE 

Consider  now  a  digital  communication  system  in  which  a  code  has 
to  be  used.  To  evaluate  the  coding  capabilities  of  the  channel  ,  a 
reasonable  criterion  would  be  to  choose  the  best  code  for  that 
channel,  and  the  corresponding  error  probability.  But  this  cannot  be 
done  in  general,  because  the  selection  of  the  best  code  is  hardly 
feasible  in  the  practice.  Now,  information  theory  provides  us  with  a 
osult  concerning  the  error  probability  that  can  be  achieved  over  a 
g-ven  channel  as  a  function  of  the  transmission  rate.  The  result  is 
,.lre  following  :  there  exists  a  block  code  of  •-ate  R  and  length  n  such 
that  the  error  probability  is  upper  bounded  by 

id.’,1  £  ,  v  '•*'  P  •  h  _  (4.1) 

where  c  is  a  constant,  and  RQ  is  a  parameter  depending  on  the  channel, 
and  called  its  cutoff  rat“.  This  relation  p1 aces  in  evidence  the  way 
at  which  the  attainable  er>or  probability  decays  as  a  function  of  the 
block  length  of  the  code.  The  larger  RQ ,  the  better  the  channel. 
Hence,  R„  provides  a  useful  one-parameter  char ac ter i za t ion  of  the 
quality  of  the  channel  in  terms  of  its  coding  capabilities  ! 1 1] . 

An  expression  for  the  cutoff  rate  of  the  channel  of  Fig.1  has 

’’een  derived  in  [1?]  (see  also  (13)  for  similar  computations  when  a 

•  on vol u t lonal  code  is  used'.  The  following  result  holds: 

"  1  ■">'  1  14.t’) 

where  ^  is  the  unique  real  and  positive  eigenva'ue  of  the  M* 

matrix  whose  elements  are 
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(3.1)  it  is  seen  that  this  is  the  relevant  parameter  for  assessing  the 
performance  of  a  digital  system  equipped  with  a  max  imum- 1 i ke 1 i hood 
sequence  detector. 

Unfortunately,  the  minimum  distance  does  not  3eem  to  be 
expressible  in  a  closed  form.  Moreover,  its  direct  computation  based 
on  (3.2)  is  impractical  in  most  cases  of  interest.  Thus,  for  the 
evaluation  of  dmiR  one  must  resort  to  algorithms  suitably  designed. 
One  such  algorithm  has  been  applied  in  [8]  to  the  transmission  over 
linear  channels.  It  is  based  on  a  computer-search  approach,  and 
results  into  a  small  "sufficient”  set  of  sequence  pairs  (£„),(£') 
differing  in  few  positions.  The  minimum 
of  ||  Z  C q(  t-nT ;  )-q ( t-nT;  |^ )  ]  j|  taken  in  this  set  provides  the 
minimum  distance. 


The  computation  of  dmjn  for  a  nonlinear  channel  has  been  studied 

in  ^9],  under  the  simplifying  hypothesis  that  only  symbol  sequences 

differing  in  just  one  symbol  contribute  to  the  value  of  the  minimum 

disf.-nce.  Consider  for  simplicity  binary  symbols,  and  L+1  consecutive 

channel  states  3  -(£  ,  t  . $  ).  Take  another  L*1-tuple 

3n  -'n+i 

2.  *  (  •  •  •  » >  with  differing  from  |n  only  in  the  first 

position,  E'  differing  from  .  only  in  the  second  position. 

from  in  the  third,  etc.  Compute  then 


,  I- 

-  '  =  V 


for  all  possible  pairs  21  (f°r  uncoded  binary  symbols,  there  are 
2  ^  such  pairs)  and  take  the  minimum  value  found.  This  is  the 
nr  n l mum  distance.  Although  this  method  needs  the  manipulation  of  a 
'  ge  number  of  sequences,  the  numerical  operations  required  are  quite 
simple. 

Yet  another  technique  for  the  computation  of  the  minimum  distance 


l 


same  set  of  moments  can  be  tighter. 

Moment  bound  techniques  are  described  in  [33  for  linear  channels, 
and  in  t  1 3 ,  [  2  I ,  [  4  ]  for  nonlinear  channels.  In  l  v.  1  the  case  of 
correlated  random  variables  an  is  considered. 


3.0  COMPUTATION  OF  THE  MINIMUM  DISTANCE 

In  the  previous  section  we  have  considered  the  eTor  probability 
of  a  3imple  receiver  whose  decisions  are  taken  symbol -by-symbol  .  In 
certain  systems,  however,  it  may  prove  highly  beneficial  to  include  a 
more  sophisticated  receiver  to  achieve  nearly  ideal  performance.  This 
can  be  done  by  using  a  maximum  1 lkel thood  receiver ,  in  which  the 
decisions  on  a  data  sequence  are  taken  by  considering  all  the  possible 
transmitted  sequences  and  choosing  the  one  most  likely  on  the  basis  of 
the  observed  signal  y(t).  The  operation  of  such  a  receiver  is 
analyzed  in  r6]  for  linear  channels,  and  in  [7]  for  a  nonlinear 
satellite  channel. 


When  tht3  receiver  is  used,  the  symbol  error  probability  is 
closely  approximated,  for  a  small  noise  power,  by 


P(e  i 


K  erf o  ( 


(5-1) 


where  K  is  a  constant,  N0/2  is  the  two-sided  power  spectral  density  of 
the  noise  v ' 1  ,  and  d  •_  is  the  minimum  distance,  defined  as 

y'  =  min  !!  £{'l(  ’-nTjE'  )  -  q(t-i,T;C  ))  !| 1  2) 

ri  r, 

with  |  z(  t  denot  lng  the  energy  of  the  signal  z(t).  In  words,  dmin  is 


the  smallest  possible  Euclidean  distance  attainable  between  received 
signals  stemming  from  different  trensmltted  symbol  sequences.  From 
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Pf  •  ;  =  l  Bferfc  fliS-*  — ]  }  . 

A  ->v 

If  the  channel  is  not  linear,  q(t0;gn)  depends  on  a^ .  an.1  >  •  •  •  >  an.L 
In  a  nonlinear  way.  A  useful  representation  of  thi3  dependence  is 
provided  by  expanding  the  functional  relationship  between  q(t0;§n)  and 

an,a„_ . 'an-L  a  Volterra  series ,  which  will  be  truncated  to  a 

finite  number  of  terms.  This  repr esentation  is  discussed  in  [1]  for 
baseband  channels,  and  in  [2]  for  passband  channels. 

Consider  now  the  problem  of  evaluating  the  averages  appearing  in 
(2.2).  In  the  most  general  case,  we  are  faced  with  the  computation  of 
an  average  of  the  form  £[  f  ( a(|_1 ,  an_4  ,  •  •  .  ,  a„_L  )  ] ,  where  f(.)  is  a  known 

function  of  the  L  random  variables  . . an-u  •  In  principle,  we 

can  compute  this  average  by  enumerating  all  the  possible  values  taken 
on  by  !'(.),  but  this  technique  is  computationally  impractical  whenever 
the  number  of  these  values  is  very  large.  As  an  example,  If  the 
random  variables  an  are  independent  and  take  on  N  values,  there  are  NL 
possible  values  for  f(.).  For  instance,  N=8  and  L=1Q  would  lead  to 

O 

about  10  computations  of  the  function  f(,). 

A  method  that  yields  both  accurate  and  computationally  tractable 
results  for  the  computation  of  the  average  required  is  based  upon  the 
theory  of  moment  bounds.  Basically,  it  consists  in  the  derivation  of 
upper  and  lower  bounds  to  E[ f ( an  ,  .  .  .  , an_L  )  ]  based  on  the  exact 
evaluation  of  a  number  of  averages  like  E[ p( an an_L ) J ,  where 
p(.)  i?  a  polynomial  function.  When  the  random  variables  an  are 
statistically  independent,  the  bounds  can  be  expressed  in  terms  of  the 
moment  s 

u  .  «  F;(  a k  )  !  ? .  o } 

r.  n  ' 

"ese  bounds  are  optimum,  in  the  sense  that  no  other  bounds  using  the 


-  4 
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P{e)  =  P{a  ^  a  } 
n  n 

3  l  Pf.v(  t  )  >  0  la  =-.M  +  P{y(  t  )  <  0  |  a  “+1 ) 
n  1  n  n  1  n 

<P.O 

=  '  l'!q(t  ;£  )+v  >C^a  -*  •  1  +  2  P{q(t  ;£  J+v^Oia  *♦:' 
on  n  o  n  1  n 


where  v>(  t0  +  nT )  is  a  Gaussian  random  variable  with  zero  mear  and 
known  variance  <7*  .  From  (2.1)  we  can  write 


f’(e)  =  t  K(erfc  fH-iiiiiuill  [a  —1} 
4  '/to  J  11 

V 

+  T  Klert'c  ' —j..  | a  *+1} 

‘  v1?"  u  ‘  n 


where  E  denotes  average  taken  with  respect  to  the  random  variable  . 

The  following  tasks  must  be  accomplished  for  the  computation  of 
P(e'.  Firstly,  we  have  to  write  down  an  explicit  expression  for 
Secondly,  we  must  compute  the  averages  in  (2.2). 


Suppose  first  that  the  channel  Is  linear.  Then  q(t„;|n)  depends 
linearly  on  the  random  variables  an,aM,  .  ..  ,aB.L,  so  we  can  write 


q(W  “  A 

l'U 


li  .a 
i  n-i 


where  hQ,h<  . hL  are  the  samples  of  the  impulse  response  of  the 

channel  .  Hence 


<)U  -,t  )  * 

o  'n 


ha  +  " 


o  n 


(2.4) 


where 


!i ,  a 


i  n  -  i 


(2-h) 


is  called  the  Intersymbol  Interference  term.  Observing  that  Z  Is  a 
symmetric  random  variable,  i.e.,  Z  and  -Z  are  equally  distributed,  we 

hav»  r rom  (2.2) : 
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where  lq(t;i)}M  is  the  set  of  waveforms  (of  duration  T)  observable  at 
the  output  of  the  noiseless  channel  in  the  interval  (G,T1.  The 
sequence  ( g  —  is  made  up  of  random  variables,  called  the  channel 
states ,  taking  on  M  possible  values.  With  our  assumption  of  a  finite 
memory  for  the  noiseless  channel,  each  depends  on  a  finite  number, 
say  L+1,  of  input  data  symbols  an  ,  so  that  we  can  set  a  one-to-one 
correspondence  among  the  and  the  (L+15-tuples  ( a„  i  a„_, .  •  ■  •  ,  an-u  '  • 

This  correspondence  allows  one  to  determine  the  statistics  of  the 
sequence  ( gn) ,  and  hence  of  x(t),  when  the  statistics  of  the  data 
sequence  (an)  are  completely  known.  We  assume  here  that  (a„)  is 
stationary. 

The  signal  observed  at  the  receiver  front-end  is,  from  Fig.1, 

y(t,)  =  x(t  1  +  vlt)  '  \  .2 

where  v(t)  is  assumed  to  be  a  white  Gaussian  noise  process. 


?.U  COMPUTATION  OF  ERROR  PROBABILITY 

The  error  probability  is  a  basic  measure  of  the  performance  of 
digital  communication  systems.  In  this  section  we  consider  a 
situation  in  which  the  receiver  operates  by  sampling  the  received 
signal  v(t)  every  T  sec. The  decision  on  the  n-th  transmitted  symbol 
is  based  only  on  the  value  of  this  sample.  For  simplicity's  sake,  we 
shall  confine  ourselves  to  consideration  of  the  case  in  which  the 
input  data  symbols  take  on  with  equal  probabilities  the  values  +1,  and 
modulate  linearly  a  given  waveform  (this  situation  corresponds  to 
binary  PAM,  or  PSK > .  The  receiver  compares  the  received  sample  with  a 
zer.  threshold,  and  sets  a  **1  if  it  lies  above  the  threshold,  a  =-1 
othe'-w’se.  For  a  sample  taken  at  time  t_=  t  +nT  ,  the  symbol  error 

n  O 

probability  i3  given  by 


the  evaluation  of  their  performances  becomes  more  demanding.  For 
example,  In  satellite  communications  the  efficient  use  of  available 
signal  power  and  bandwidth  makes  them  to  operate  on  a  tightly 
bandlimited  nonlinear  channel,  where  the  computation  of  error  rates 
(say)  Is  not  analytically  tractable. 

The  aim  of  this  paper  is  to  review  some  of  the  techniques  that 
have  i  3  recently  proposed  for  evaluating  the  performance  of  digital 
communict  ion  systems.  The  emphasis  Is  placed  here  on  the 
computational  algorithms  that  allow  this  evaluation  to  be  performed 
without  resorting  to  simulation  techniques,  which  might  be  either 
prohibitively  expensive  or  not  completely  satisfactory  in  terms  of 
accurac y  . 

We  are  Interested  In  considering  digital  communication  systems 
such  as:  high  data-rate  transmission  systems  with  limited  bandwidth; 
satellite  and  microwave  radio-relay  links  with  non  1 1  near i tl es ; 
multi-path  and  fading  transmission  systems.  The  common  feature  of  all 
these  systems  Is  that  they  can  often  be  modeled  as  follows.  At  the 
receiver  front-end  of  the  system  there  is  a  desired  signal  component 
disturbed  by  an  additive  noise  term.  Under  the  assumption  that  the 
noiseless  portion  of  the  channel  has  a  finite  memory  (i.e.,  the 
received  signal  at  any  given  time  instant  does  not  depend  on  the 
Infinite  past  of  the  transmitted  signal)  the  number  of  waveforms  that 
can  be  observed,  before  the  addition  of  noise,  In  any  time  Interval  of 
finite  duration  is  finite.  With  reference  to  Fig.1,  if  T  denotes  the 
inverse  of  the  symbol  rate,  l.e.,the  time  interval  between  the 
emission  of  two  consecutive  symbols,  the  signal  x(t)  can  be 
represented  In  the  form 
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SUMMARY 

In  this  paper  we  review  some  techniques  for 
evaluating  the  performance  of  digital 
communication  systems  operating  on  channels 
characterized  by  additive  Gaussian  noise  as  well 
as  linear  and  nonlinear  distortions.  The 
parameters  considered  are  the  error  probability, 
the  minimum  distance  (useful  when  a 
maximum-likelihood  sequence  receiver  is  used),  the 
cutoff  rate  (useful  when  coding  has  to  be  used  on 
the  channel),  and  the  spectral  occupancy  (useful 
when  two  or  more  users  share  the  same  frequency 
band).  The  emphasis  is  placed  on  the 
computational  algorithms  that  allow  these 
parameters  to  be  evaluated  numerically. 


1.0  INTRODUCTION  AND  MOTIVATION  OF  THE  WORK 

In  recent  years,  there  has  been  an  Increasing  interest  in  digital 
communication  systems.  This  is  partially  due  to  the  availability  of 
novel  digital  signal  processing  devices  as  well  as  to  certain 
attractive  features  of  digital  systems,  as  for  instance  the  fact  that 
digital  signals  are  more  amenable  to  enciphering  and  deciphering  than 
analog  signals  for  secure  communication  purposes.  On  the  other  hand, 
the  ever-increasing  demand  for  digital  services  has  suggested  the 
Introduction  of  more  efficient,  and  hence  more  sophisticated, 
communication  systems.  As  the  complexity  of  these  systems  . vases, 
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Probability  of  Error  for  Digital  Systems  with 
Inaccurately  Known  Interference 

h/lO  M  BIGLIHRI.  senior  member,  ieie 

Abstract  — The  “moment  bound  theory”  >»  known  to  provide  a  uveful 
technique  to  evaluate  error  probabilities  tor  digital  communication  systems 
in  the  presence  ot  additive  noise  and  random  interference.  In  this  corre¬ 
spondence  this  theory  is  estended  to  the  case  where  the  moments  of  the 
interference  are  known  only  within  certain  intervalv  and  upper  and  lower 
bounds  to  the  error  probabilities  are  sought.  A  situation  like  this  can  occur 
in  several  applications  ior  example,  the  esacl  statistics  of  the  interference 
may  not  be  known,  and  only  estimates  of  the  first  moments  may  be 
available  Another  example  arises  when  the  signal  is  disturbed  by  intersyin- 
bol  interference  generated  bv  a  ihannel  impulse  response  whose  samples 
are  known  only  in  a  certain  interval  —  either  because  they  have  been 
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measured  with  finite  accuracy,  or  because  we  want  to  estimate  the  error 
probability  for  a  class  of  channel  impulse  responses.  Several  numerical 
examples  are  provided  which  show  the  range  of  appficabi.’.t.  of  this 
technique. 

1  Introduction 

In  recent  years,  several  techniques  have  been  proposed  to 
evaluate  error  probabilities  for  digital  communication  systems  in 
the  presence  of  additive  noise  and  random  interference.  Among 
these  techniques,  multidimensional  moment  bounds  [1]  appear  to 
be  most  useful,  because  they  are  generally  very  tight  and  can  be 
evaluated  with  modest  computational  effort  Moreover,  no  other 
bounds  ba  cd  only  on  the  moments  of  the  random  interference 
can  be  tighter. 

In  this  correspondence  we  extend  the  moment  bound  theory  to 
the  computation  of  error  probabilities  in  a  situation  in  which  the 
moments  of  the  interference  are  known  only  in  certain  intervals. 
The  model  assumed  for  the  analysis  is  the  following:  consider  a 
digital  communication  system  where  the  output  decision  random 
variable  at  each  sampling  instant  is  given  by 

R  —  ah  +  Z  +  * ,  (1.1) 

where  a  is  the  input  information  symbol  taking  on  values  in  a 
finite  set,  h  is  the  known  positive  peak  overall  sampled  system 
response,  r  is  an  arbitrary  random  noise  with  known  cumulative 
distribution  function  F,{  ),  and  Z  is  the  random  interference, 
typically  modeled  as  a  sum  of  independent  random  variables.  We 
assume  that  the  random  variables  a,  Z,  and  r  are  mutually 
independent,  and  that  Z  has  a  finite  range  [-£>,  £>).  Evaluation 
of  the  error  probability  of  the  above  system  can  be  based  on 
moment  bound  theory  by  first  expressing  it  as  the  average 

Pt-Ey[n(Y)).  (1.2) 

where  Y  -  /(Z)  is  a  random  variable  with  finite  range  (u,  h], 
O(-),  /(  )  are  known  functions,  and  £y  denotes  average  taken 
with  respect  to  the  random  variable  Y.  Then,  if  the  finite  se¬ 
quence  of  moments  {  c, }  7_ ,  is  available,  where 

c,  *£[>"],  i-1, ■■■,».,  (1.3) 

the  moment  bound  theory,  as  developed  by  Krein  (2),  provides 
the  lightest  upper  and  lower  bounds  to  £V[S2(K)|  in  the  form 

L  w/n(y/)  s  £y[om]  *  L  <a{yr).  0-4) 

i-i  i-i 

where  the  abscissas  {y,'),v-i.  ( y,"}?-\  an^  the  weights  ( w/ 
(w’,"}1AL'1  can  be  computed  on  the  basis  of  the  moment  set 
(c, The  values  of  N‘  and  N”,  as  well  as  the  rules  for 
computing  weights  and  abscissas  in  (1.4),  vary  according  to  the 
parity  of  n  and  to  the  sign  of  the  (n  +  3)rd  derivative  of  the 
function  Q  in  the  interval  [a,  h).  Details  on  the  actual  evaluation 
of  moment  bounds  can  be  found  in  [1|.  together  with  a  number  of 
generalizations  of  the  model  considered  here 

As  an  example,  if  a  takes  on  values  +  1  with  equal  probabih 
tics,  and  r  is  a  zero-mean  Gaussian  random  variable  with  vari¬ 
ance  a ‘ ,  the  error  probability  in  (1.2)  lakes  the  form 


where 

0(a)  ”  (2")  '  7"CXP(  -r'/2)  Jl 
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Fij.  1  Locus  n  of  vectors  r  such  that  r '  *  c  «s  c". 


SNR(dB) 


Hence,  we  can  write 

(1.6) 

This  is  equivalent  to  the  choice 

Y-h  +  Z 

(1.7) 

and  Q(x)  -  Q(x/o)  in  (1.2).  However,  this 
only  available  choice.  In  fact,  if  we  take 

is  by  no  means  the 

Y  -  (h  +  Z)\ 

(1.8) 

ihc  error  probability  car  be  expressed  as 


cause  we  want  to  estimate  the  error  probability  for  a  class  of 
channel  impulse  responses. 

The  solution  to  this  problem  can  be  obtained  by  using  a  theory 
developed  by  Krein  in  conjunction  with  the  so-called 
“CebvSev- Markov  problem  with  moments  in  a  parallelepiped" 
[2],  This  theory  deals  with  the  evaluation  of  upper  and  lower 
bounds  of  the  integral 

l-  (hG(x)<lP(x). 

J  a 

where  G(  )  is  a  known  function,  and  P()  is  the  cumulative 
distribution  function  of  a  random  variable  whose  first  n  moments 
are  specified  in  the  form  (1.10). 


P.~ 


(1.9) 


which  is  valid  provided  that  h  -  D  z  0  (the  “open-eye”  assump¬ 
tion).  In  this  case  I2(  x )  -  Q(Jx  /a ). 

In  this  correspondence  we  shall  consider  the  situation  in  which 
the  moments  (c, )“_ ,  are  not  available  exactly,  but  it  is  known 
that  they  he  in  finite  intervals  In  other  words,  two  sets  { t, ) T-  i 
and  {  f, )".  |  are  known  such  that 

f,  S  r,  £  (, .  t  -  I .  .n.  (110) 


and  we  want  to  find  upper  and  lower  bounds  to  the  system  error 
probabilities 

A  situation  like  this  can  occur  in  several  applications.  For 
example,  the  exact  statistics  of  the  interference  may  not  be 
known,  and  only  estimates  of  the  first  moments  may  be  available. 
Another  example  arises  when  7.  represents  the  intersymbol  inter¬ 
ference  generated  by  a  channel  impulse  response  whose  samples 
are  known  only  in  a  certain  interval  This  may  occur  because 
those  samples  have  been  measured  with  finite  accuracy,  or  be¬ 


ll.  Moment  Bounds  with  Moments  in  a 
Parallelepiped 

In  this  section  we  shall  briefly  outline  the  theory  of  moment 
bounds  with  moments  in  a  parallelepiped,  and  we  shall  derive  a 
solution  to  the  problem  posed  in  Section  I. 

Consider  first  the  set  of  all  the  probability  distribution  func¬ 
tions  Fy(  )  of  the  random  variable  V  whose  range  is  the  finite 
interval  [a,  6]  and  whose  first  n  moments  are  r,,  •  ,c„  Denoting 
such  set  by  F(c),  where  c  is  the  n-vcctor  whose  components  arc 
Ci,-  •.<■„,  the  "classical"  moment  problem  can  be  formulated  as 
the  search  for  the  minimum  and  maximum  values  of  the  integral 
/S^(y)  dFr(y)  as  Fr  ranges  over  V(c).  More  specifically,  we 
seek 

/(c)-  mm  fa(v)JFy(y),  (2.1) 

F,  e  Ptr)  Ju 

and 

/(c)-  max  JhU(y)dFy(y).  (2  2) 

Fr*  PtOAi 
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Consider  ihcn  i*n  n- vectors  c  and  <•",  with  components  .  r„ 
and  cj'.  respectively  We  define  a  partial  ordering  in  the 

space  R"  of  n  sectors  bv  defimng  c'  ■<  c"  if  c'  *  c"  and 

(  D*  s  (  if  *ti.  k  -  1.  ,/i  (2  3) 

The  sc(  of  vectors  e  such  that  c  *  c  *  c"  is  then  the  paral¬ 
lelepiped  II 

(  I  1  s  (  •  1)"  *e,  (  -1)"  kc‘k’,  k  ~  1.  -,n 


1AUI  l  I 

Condi  i  ions  kir  rut  Vm  idi  i  y  or  I  mom  m  2 
Whi  n  thk  Noisl  IxOaiissian 


Y  -  ({/.) 

Y  -  h  +  / 

>  -  \h  *  /)' 


'(v) 


Condi  lions 

h  -  7)  ~ 

— ; - >  n  odd 

Jlo 


I)  >  U.  il  odd 


The  po.nis  e  and  c"  are  the  endpoints  of  the  “oblique  diagonal" 
i  f  ;  1  (see  I  ig  1  for  an  example)  We  have  the  following  result. 

theorem  I  [2.  p.  20X1.  If  c’  and  c"  are  moment  sequences  of 
random  variables  with  fume  range  [a,  />),  and  a  >  0,  then  any  c 
such  that  c‘  a  c  *:  c"  is  a  moment  sequence. 

Let  us  now  suppose  that  the  random  variable  Y  has  moment 
sequence  l>mg  in  the  parallelepiped  11  whose  endpoints  of  the 
oblique  diagonal  arc  the  /i-vectors  (c[.  c'2,  cj,  •  •  ■  )  and 
(([’.  ev.  ■  •  ■  )  If  we  denote  by  T([i)  the  set  of  all  the 
probability  density  functions  whose  range  is  (a,  ft)  and  whose 
moment  sequence  lies  in  the  11,  the  moment  problem  considered 
m  the  muoduction  can  be  formulated  as  the  search  for  the 
minimum  and  maximum  values  of  the  integral  j^Q(y)  dFY(y)  as 
Fy(  )  ranges  over  V{  11).  That  is 

/(ID-  mm  fhn(y)dFr(y),  (2.5) 

f  re  villi-',, 


V ( 1 1 )  —  max  f*V(y)  dFr(y).  (2  6) 

F,e  I'ltli-'u 

The  following  theorem  holds. 

Theorem  2  [2.  p.  2201:  If  FI  is  the  set  of  n-vectors  e  such  that 

c'  *  c  = i  c”,  then 

/(II)  -  /(c  )  (2.7) 

and 

/(ID  -/(«•")  (2  8) 


provided  that 


(  I J  '  " 
a" 


a  >  0 

0.  k-  1.2.- 
1 1  ’  ( /)  >  0,  for  a  <,  I  5  h. 


Ill  Application  to  rut  Gaussian  Cask 

In  this  section  we  shall  examine  doselv  the  implications  of  the 
technical  conditions  (2.9)— (2.1 1).  Since  their  validity  is  dependent 
on  the  choice  of  the  function  fl(-).  we  shall  consider  the  two 
special  cases  considered  in  Section  1,  namely,  (1.7)  and  (1,8). 
corresponding  to  ii(.t)  -  Q(  x/o)  and  Q(x)-Q(Jx/n>  For 
other  possible  choices  of  12(  •)  (see  jl|)  the  analysis  can  be  worked 
out  by  straightforward  extension  of  the  techniques  presented 
here. 

Consider  first  condition  (2.9),  which  requires  min  )'  >  0  This 
is  equivalent  to  the  requirement  that  h  -  D  >  0  (the  open-eye 
assumption)  when  (1.7)  holds.  If  (1.8)  was  chosen,  condition  (2.91 
is  always  satisfied,  but  the  eye  must  be  open  for  (1.8)  to  hold 
Thus,  h  -  D  >  0  is  a  necessary  condition  ui  both  cases. 

Consider  then  (2.10).  If  ft(x)  -  Q(x/o),  we  have  for  k  i  1: 

(-I)-" 


where  //„(•)  is  the  Hcrmite  polynomial  of  degree  n  [3.  p.  691). 
Hence,  for  (2.10)  to  be  satisfied,  the  following  must  hold: 


(27) 

and 

\  /2  o  ' 

h°- 

A  -  1.2. ■  ■  ■.», 

( 7.2) 

(2.8) 

(-  O'" 

■<K;) 

1  >  0. 

(3  3) 

As  u  >  (I  may  he  arbitrarily  small,  the  rule  for  determining  /(II ) 
and  /(II)  embodied  in  Theorem  2  can  be  extended  to  the  case 
a  -  0  j  2 .  p  220) 

before  proceeding  further,  let  us  comment  briefly  on  the  results 
summarized  by  the  theorem.  If  the  technical  conditions 
i2  9)  (2  11)  are  satisfied,  then  £, )fl(  T)).  and  hence  the  error 
piobabiliiy,  is  bounded  atxive  and  below  bv  two  “standard" 
moment  bounds  /(r'  )and  /(e  )  Evaluation  of  these  two  bounds 
van  be  pc  Tunned  bv  using  the  techniques  described  in  (1),  where 
the  moment  sequences  to  be  used  for  the  calculation  arc  obtained 
bx  (.iking  alternately  the  upper  and  lower  bounds  of  (1  !(>).  In 
fact,  by  comparison  of  (1  10)  jnd  (2  4)  it  is  seen  that  the  vectors 
r  and  <  to  be  used  m  (2  7)  (.’  ft)  are  given  by 


If  n  ts  even.  (3.3)  cannot  hold  true;  if  n  is  odd,  (3.3)  is  always 
satisfied,  and  a  sufficient  condition  for  (3.2)  to  hold  is  that 
a/  y/2 <7  be  larger  than  the  largest  root,  say  of  //t  ,(x), 
k  -  1,2,-  •  -,n.  Moreover,  since  the  value  of  ik. ,  increases  with 
increasing  k.  the  sufficient  condition  becomes 

?2a  *  “  (?  41 

where  r„_ ,  denotes  the  largest  root  of  //„  ,(.v).  and  u  -  h  -  D. 
If  on  the  contrary  V.(x)  -  Q{fx  /o),  from  |1)  we  have 

(-l)"*,“n,‘,(a)-  (-1)""  («>o  " 

A  =  I (35) 

where  g,,{  t)  is  a  nonzero  polynomial  with  nonnegatne  uieffi- 
cicnts  Fq.  (3.5)  shows  ihat.  since  a  >  0.  for  /;  odd  (2.10)  is  always 
satisfied 

Consider  finally  (2  II)  Tlie  computations  iust  performed  van 
be  used  to  show  llial  lor  U(  \  )  -  v/n)  (2  11)  holds  line  for  n 
odd  provided  llial 


'i  ii  cun  ail' I 


Since  /„  >  /,  this  i  onditmns  implies  (3  4)  as  well  Similarly, 
lor  S2 (  s  )  —  (/( v  \  n  :  and  n  ixld  it  can  be  seen  l ha l  (2  II)  alw  av  - 
holds  true 

The  conditions  for  the  vahdilv  of  rtieorem  2  are  summarized 
in  Tabic  f.  It  is  seen  (hai  in  both  eases  il  is  necessary  llial  n  K 
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Kg  2  Upper  and  lower  bounds  io  error  probability  for  a  binurv  comnuinj- 
tation  I>iMuibajut>  arc  uddtivc  Gau*s»jn  noise  atui  intcrsytnbol 

interference  generated  bv  .«  i>ju"iati  pulse  whose  samples  can  fluctuate  in 
*1 1 « ■  r\  3  j  s  of  i! 'at.ve  a  r  pi  ■•rut  abound  iheir  nominal  values. 


and  Im,}".,  fox  >n  (4  ')  will  provide  two  sets  of  mo¬ 
ments  {<■,}“_ |  and  that  satisfy  (1  10),  and  hence  allow 

one  to  use  the  theory  outlined  in  Section  11 

These  results  hase  Seen  applied  to  a  system  whose  nominal 
impulse  response  is  specified  In  the  Gaussian  pulse 
esp  -  (Ni/5/  )-' sampled  at  multiples  uf  !  ;  hen  A  *  1./.  is  taken 
as  h.  and 

6 

/>  -  Y  \h  ■  «  0  I  sjos 


We  assume  that  the  A  .  i  -  I.  can  ^actuate  around  their 
nominal  values  within  an  interval  of  width  Sh  We  define 


<  A 


Afi 

A, 


lOO'l 


and  the  signal-to-noise  ratio  (SNRi 


odd  However,  this  n  not  ,i  restriction  In  fact,  it  can  be  easily 
plowed  that  it  is  sufl.sieiit  to  v lunge  il(  )  into  ii(  )  (i  e  .  to 
icK'k  lor  tsoimjs  io  /  .  i  nias.c  the  thci'iem  hold  for  n  even  as 
wed 


SNR  -  20  log  h 

lfy  taking  n  -  10  moments,  and  choosing  V  -  A  +  Z,  we  obtain 
the  numerical  results  shown  in  Fig  2  for  the  error  probability  of 
the  digital  system. 


!\  Numerical  Results 

In  thus  section  wc  present  some  numerical  results  for  the  error 
bound  thcoiy  discussed  in  Sections  II  and  111  In  particular,  we 
shall  consider  a  binary  digital  system  perturbed  by  additive 
Gaussian  nooc  and  inlc'ss mbol  interference 

i 

/  -  L>  A.  (4>) 

/-I 

where  a,,  (  I.  ait  .ndependent  identically  distributed 

random  variables  taking  on  values  t  1  with  equal  probabilities 
We  assume  that  the  samples  h„  I  -  1,  ,/  .  of  the  channel 

unpul  .e  icsp.uise  are  m.wi  irately  known,  so  that  each  of  them 
lies  in  a  finite  interval  whose  boundaries  arc  known: 

h,  v  A,  <  <i,.  (4.2) 

Consider  the  moments  is'  /,  all  ol  its  odd-order  moments  are 
zero,  whereas  the  even  older  moments  ran  be  given  (fie  form 

.2rrt(  )/.;-•  -  ft;'"'.  (43) 

where  the  summation  is  es tended  over  all  the  possible  /.-tuples  of 
different  indices  m,.  . >n ,  such  tha’  2m,  t  +  2m/  —  2m. 

The  notation  (?m;  2m. ,  ,2m,  )  denotes  the  multinomial 

coefficient,  which  is  positive  and  we  have  taken  into  account  the 
fact  that  T[n;'"|  -  1  for  ail  m  1  q  (4  3)  shows  that  the  even-order 
moments  of  /  arc  polynomials  in  the  variates  A;.  .A;  with 

imiincgativc  coefficients,  which  implies  that  they  are  increasing 
functions  of  A;,  ,A;  This  shows  that,  by  substituting  for 

A;,  ,/?;  in  (4  3)  thru  tummum  and  maximum  values,  respec¬ 
tively,  we  get  two  scquimv.  of  moment'  { jt.  and 

such  that 

,.  ![/•]■.  1-  (4.4) 

C  onsider  now  the  random  variables  r  ~  A  •»  /  and  Ym  (h  *■ 
/  '  the  iiaiinents  .  I  >  i  an  t>c  <  xpic’sscd  til  holli  r  ases  llliougfl 

1 '  ,  tin  tin  r ;  I  ,  of  /  ,i  .  h  -l|.  . 

/  f  >■'!-/  (A  >  7)'" 

■  Y  (  r'l  V  .  (4  M 

w  here  ;i  -  1.2  Sim  i  1  1  i  e  expressible  as  a  linear  combination 
Of  moments  a ;  with  p<-  .m  fficicnls.  siiPsiiuiiion  id  ;  g  . 


V.  Conclusion 

The  theory  of  moment  bounds  was  extended  to  encompass  a 
situation  in  which  the  moments  of  the  random  variable  represent¬ 
ing  the  interference  are  inaccurately  known.  In  this  case,  the 
evaluation  of  upper  and  lower  bounds  to  the  error  probability  is 
reduced  to  the  computation  of  "standard"  moment  bounds,  in 
which  the  sequence  of  moments  is  constructed  on  the  basis  of  the 
upper  and  lower  bounds  to  the  moments  of  /.. 
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Appendix  C 
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OPTIMAL  LINEAR  RECEIVING  FILTER  FOR  DIGITAL 
TRANSMISSION  OVER  NONLINEAR  CHANNELS 


E.  Blgllerl,  M.  Ell*,  and  L.  LoPrestl 

Dlpartimento  dl  Elettronica 
Polltecnico  di  Torino,  ITALY 


A  I-  VTKACT 

Cr.!-  optimum  1  incur  receiving  filter  for 
digital  t  runanuaion  over  a  nonlinear  channel  1* 
specified.  undor  the  assumption  that  the  channel 
hud  a  finite  memory  and  that  the  noise  is 
additive  u.ioaniun,  an  optimum  receiver  ie  sought 
in  thu  form  01  a  linear  receiving  filter  followed 
ty  «  aymbol-retc  sampler  and  a  memoryless 
decision  device,  ".'he  receiving  filter  ia  choaen 
so  us  to  minimise  the  mean-square  error  between 


the 

input 

to  the  decision  devlcu 

and 

the 

t  ran:i«u  t  ted 

symbol . 

It 

10 

shown  thut  the  structure 

of 

this 

optimum  receiving  filter  correeponde  to  e  bank  of 
matched  filters,  euch  followed  by  a  linear 
transversal  filter.  The  number  of  matched  filters 
is  equal  to  the  number  of  linearly  independent 
waveforms  that  con  be  observed  at  the  channel 
output  in  a  eyrabol  period.  This  result  is  a 
generalisation  of  s  fact  which  la  wall-known  for 
1 ineurly  modulated  signals  transmitted  over 
linear  chunnels.  Alao,  the  performance  of  the 
receiver  ao  optimized  Is  evaluated  and  compared 
to  conventional  designs. 


d.-Arti-IbbT  Or  THU  PHJULKM  AND  MOTIVATION  OK  THh1 
■  OKA 

Let  !  >1 »  t  i  1  )  I  '  (  be  a  set  of  finite-energy 

waveforms  with  common  duration  T,  and  (5  )  .,  a 
wide-sense  stationery  sequence  of  PancPom 

variables  taking  value*  in  the  set  |l . M) . 

Lonsider  then  the  signals 

»(  t)  •  V~  -I't-nT;^)  (l.l) 

and 

y( t)  *  i !  t )  •  v  t ;  (1.2) 

vere  v1  >  j  a  w*  .to  noise  process  Independent 

the  a.--,  j.-nr,  |  ),  with  two-elded  power 
spectral  U”  nt  .  ,  '  The  problem  ye  consider 

m  iiie  ►(.(.«-!  i'i  tut  following!  for  e  given 
■  r'-e  r~  i  n  i  a-  .  !  a1,.)  of  the  random 

variable*  i  an  :  given  time  t  *  t  ♦  nT. 

r  no 


0<t  <T,  choose  the  impulse  response  u(t)  of  a 
linSar  time-invariant  system  such  that  the 
quantity 

/-K[ly(tn)*u(tn)-a(|n)!2]  (1-5) 
(wheru  *  denotes  convolution)  is  u  minimum. 

The  problem  just  stated  is  the  formulation, 
in  a  ruthur  goneral  form,  of  a  problem  uriaing  in 
data  transmission.  We  want  to  specify  an  optimum 
receiver  in  the  form  of  a  linear  receiving  filter 
followed  by  e  symbol-rate  sampler  and  a 
memoryleea  decision  device.  The  channel  ia 
assumed  to  be  time-invariant  (linear  or 
nonlinear),  snd  to  hova  a  finite  memory.  The  data 
symbols  may  be  encoded,  and  the  memory  introduced 
by  the  encoder  ie  assumed  to  be  included  in  the 
channel  part  of  the  data  transmission  model.  The 
Channel  consist*  of  a  noiseless  part  followed  by 
the  addition  of  noise.  This  noiseless  part  ia 
depicted  in  fig.'  •  A  sequence  («n)  °T 
independent  data  symbols  is  fed  into  a  3hlft 
register  with  L-1  delay  elements  (denoted  by  P  in 

the  figure).  The  L-tuple  l',r)>an  f . un 

denotes  the  atate  of  the  channel  at  the  discrete 
tin*  n.  We  assume  that  there  are  M  different  such 
L-tupl*s,  that  we  put  in  s  one-to-one 
correspondence  with  the  integers  1,...,M.  In  any 
time  interval  of  duration  T  there  ie  a  finite 
huab«r  of  possible  signals  at  the  output  of  the 
noiseleos  part  of  the  channel,  and  they  depend  on 
the  atate  of  the  channel.  We  denote  by  q(t-nT;{  ) 
the  waveform  at  the  channel  output  In  tRe 
intervel  nT<t<(n*l)T.  Thus,  before  the  addition 
of  noise  the  transmitted  signal  has  the  form 
(l.l)  and  the  signal  observed  at  the  receiver's 
front  end  has  the  form  (1.2).  For  a  receiver 
structure  as  depicted  in  Fig. 2,  we  want  to  choose 
u(t)  such  that  the  sample  y(t)  taken  at  time 
t  *nT  is  at  close  as  possible  to  the  transmitted 
symbol  an.  If  we  define  »({n)  *  »n.  the  quantity 

y(t  )*u(t  )-a(4  )  in  (l.j)  is  tho  difference 
n  n  n 

between  the  Input  to  the  decision  device  and  the 
corresponding  transmitted  symbol.  Thus 
minimisation  of  g  provides  s  minimum  uquare 
error  receiver. 


Optir.i ii  t  ion  of  the  receiver  under  this 
minimum  n«u;>  aqu  a  re  niTor  criterion  for  o  linear 
channel  end  pul se-am] I i \ ude  modulation  received 
much  attention  in  tne  literature  (see,  e.g., 
[l]'.  More  recently,  optimum  receiving  filters 
were  considered  for  nonlinear  chunnela  as  well 


That  work  was  motivated  by  digital  satellite 
communication,  in  which  amplifiers  are  used  that 
operate  at  or  near  saturation  for  batter 

efficiency.  Fredicson  . 2 j  considered  a  quadrature 
PSK  system,  and  used  an  approximate  model  for  the 
nonlinear  amplifier.  Meaiya,  F.cl^ne  and  Campbell 
3  ]  specified  an  optimum  receiving  filter  for 
binary  PSK. 

This  paper  is  an  extension  of  the  previous  work 
done  in  this  area.  Our  approach  Is  gsneral  enough 
to  encompass  a  number  of  multilevel  modulation 
schemes  and  any  kind  of  noiaslesa  nonlinear 
channels-  Consideration  of  coded  symbols  is  also 
possible,  as  well  us  the  addition  of  interfering 
signals  other  tnur,  white  noise.  However,  for  sake 
of  simplicity  we  shall  not  deal  with  these  more 
general  situations  here.  In  the  following,  we 
anall  provide  a  combined  analytical  and 
simulation  technique  to  specify  the  optimum 
receiver  filter.  From  a  theoretical  point  of 
view,  it  is  interesting  to  observe  that  this 
optimum  filter  con  be  thought  of  aa  composed  of  a 
tnr.k  of  filters,  matchod  to  a  aet  of  n  <  M 
waveforms  that  form  a  buaia  for  (q(tii)i^  ,  Vn 


cascade  with  n  1 nf i ni te- length  transversal 
filters.  This  la  a  generalization  of  a  result 
which  is  well-known  in _ the  case  of  PAM  modulation 

and  linear  cham.oij  1  I. 


THE  uhNERAL  SOLUTION 

To  formulate  in  the  frequency  domain  the 
problem  of  minimizing  <?  in  (1.J),  let  us  denote 
with  U(f)  the  transfer  function  of  the  receiving 
filter,  and  witn  X(f)  the  Fourier  transform  of 
the  signal  x(t)  defined  in  (t.l).  We  have 


X(f)  -  E  tt(f;{  )  oxp(-jCrrfnT) 

new 


w.nere  l0(f;i)|; 


aro  t»'/  touri,r  transforms  of 


viia  fwriir  transforms  01 
the  waveforms  lq(t;i)lM  .  Eq .  (1.5)  c4n  fee 
rewritten  as  follows;  1,1 

^"*°a*.J  f '  r(  f .  f 1  )  C*(  !’’  )exp[  j2tt(  f-f ’  ) t  ]df  d  f- 

( ,m  m  n 

VTf)  u(fl  df  •  N  /2  j  |u(f)|2df  (2.2) 

«•* 

where  M  denotes  real  part,  the  auperscript  • 
denotes  complex  conjugate  and  the  following 
iittfini  ♦ions  have  do  en  used  i 

r(f,f)  -  n'x(f )x*(f)]  (2<5) 

V(f!  -  exp'  -  jj.r  r  )  e;„(£  )  X*(  f)  ]  (p.4) 


0*  -  E[|u(5n)|2]  (2.5) 

U»ir\g  standard  variational  calculua 
technique^  it  can  be  shown  that  a  necessary  and 
sufficient  oonditlon  for  U(f)  to  minimize  i  is 
that  it  be  solution  of  the  following  integral 
equation : 
o» 

\  r'(f,f'  )  U(f  )  exp( -J2yr(  f-f  ’  )  t  )  df’  ♦ 

Zm  n 

*  No/2  U(f)  -  V( f )  (2.6) 

Thus  far,  we  have  not  yet  exploited  our 
knowledge  of  the  structure  of  X(f),  provided  by 
(2.1).  Recalling  that  the  aequence  (£  )  is 
stationary,  we  can  define  n 

R(f,f’sn-«)  -  E[Q(f;fn)  0*(f’s{^]  (2.7) 

ao  that  r(ftf)  can  be  written  in  the  form: 

••  *9 

r (f.f)  -LE»(f.f;"-)  exp(-J2;t(nf-mf ’  )T)- 

M  «•  «■•!»  nt-la 

•  H(f  ,f’  i  l)exp( - J2rtl fT)  ]  6(  f-  f '  -n/T)  (2.8) 

where  the  equality 

E  exP(*J2rro(  f-f  ’  )T)  -  i/T  Y~  <5(  f-f'  -n/T)  (2.9) 

has  been  used.  If  we  define  the  functions 

a* 

A(f,f)  -^R(f,f;i)  exp(-J2nTrv)  (2.10) 

and 


c„(f)  -  A( f , f-n/T)  (2.,,) 

r"(f,f)  can  be  rewritten  as 

r(f,f)  -  1/T  C  (f)  <5(  f-f*  -n/T)  (2.12) 

so  that  the  integral  equation  (2.6)  takea  the 
form: 

m 

i/TEGf(f)  U(f-k/T)  exp(-J2nvt  /T)  ♦ 

X-.e  0 

*  No/2  0(f)  -  V(f)  (2.13) 

The  theorem  that  follows  provides,  under  certain 
conditions,  s  closed-form  solution  of  (2.1)). 

Theorem.  If  tho  functions  V(f)  and  C  (f)  in 
12.13)  can  be  written  in  the  form!  n 


V(f)  -  Qf(f)  B(f) 


(2.14) 


Cn(f)-Q  (f)  JT(f)Q*(f-n/T)exp(-j2nhto/T)  (2.13) 

where  the  superscript  T  denotes  transpose,  a 
dagger  denote*  conjugate  transpose,  Q(f)  i9  n 
column  M-vector  whose  components  are  the  Fourier 
transforms  of  the  waveform*  q(t;i),  HUN,  p(f) 
is  a  column  N-v*ctor  of  frequency  functions 

mlt  t  oWfU?  P,rl°d  ,/Tl  “nd 

matrix  of  frequency  funotlone  periodic  with 
period  1/T,  then  (2.13)  admit,  the  solution 


,!  Til') 


(2.16) 


,ere  r(f',  U  a  col.mn  M-vector  of  frequency 
:  nc i i on  a  periotic  •  -  t  pc  r  i  od  1  /  .  • 


vof 

Tuba  11 1  - !  on  o! 


(2.16"'  into  (2.15) 


/*•  n  ^  n  Q^f-*/"-') 

.  ■.  •  <  /  "n  !•:•  -  ,’( f)B( f’ 


(2. 17) 


;.eri,  A  ler.otes  '.He  M.M  identity  matrix- 
'  o'.vulcntlv  ,  ;  r  .  '  V  '  c,n  be  rewritten  in  the  form 

;t  -  f)  -  v.  (2.Ui) 

v;  .'  >t,e  t -vector,  periodic  with  period 
'  .  ,  vi  e  f  i  ne  1  V  > 


•.if)  r(f)  -  !'.!■> 


(2.19) 


0*  . 

>5_w’(  f- k /T)  0T  (  f-k/T)  *Nq/?  J  (2.20) 

k«-  *> 

Equation  (2.19)  can  be  rewritten  in  explicit 
form  in  the  time  domain.  If  c  (t),  KUN,  denote 
the  Inverse  Fourier  transforms  of  the  components 
01  the  vector  C(f).  the  periodicity  of  C(f) 
implies  that  c  (t)  have  the  following  form: 

C  (t)  -  2  c>  i(t-jr)  (?-2») 

rente,  the  ttnu-domuin  version  of  (2.18)  la 


V  V  q'(t;l)*<S(t-jT)  - 

XM  )  'if*  * 

-  S  r  S  ei  ,  q*  i  t  -  J  T  i  i )  ]  -  0 


(2.22) 


Thus.  li  the  wuveforma  q(t;i)  are  linearly 
.-dependent,  !  2.22),  and  hence  (2.18),  can 

r-ell  If  I,nj  only  If  ell  the  0  are  equal  to 

tero.  This  entalla  C!  (  r )  -O ,  i .  e  .  ,  J  assuming  that 

'■■'{)  is  none  i  n^u  1  «  r  . 


r(D  -  >f  ir)  h(D 


(2.25) 


r.q  .  (2.25)  shows  the  t  T  (  f )  is  periodic  with 

f«nod  t/T,  a»  it  should  be.  ^ 

Assume  now  ih*t  the  waveforms  in  |q(t;i)|4  _ 

ore  not  linenrly  independent .  In  thie  situation, 
let  |  ,  n<*. ,  denote  a  set  of  linearly 

independent  Niala  functions  for  !q(t;i)|*^  .  If 

0  notes  the  set  of  Fourier 
transforms  of  these  basis  functions,  and  U^(f)  la 
the  K-vector  whose  components  are  these  Fourier 
■  raneforme,  an  n*M  rojtri*  1)  can  be  found  such 


v,  •  f  •  :  Ui 


whose  solution  ia  now  given  by 
D  C(f)  -  0 
o r  equivalently , 

r(r)  -  r*D  b(f) 


(2.26) 


(2.27) 


whsre  D*  ia  the  Kin  matrix  called  the 
Moore-Penrose  pseudoinverae  of  D.  The  theorem  is 
proved . 


Thl*  theorem  ehowa  that  the  optimum  receiving 
filter  In  our  situation  can  be  thought  of  as 
composed  of  a  bank  of  filters  with  transfer 
functions  Q**(  f ;  1 ) ,  each  cascaded  to  an 
Infinite-length  transversal  flltor.  The  filters 
Q*(f;i)  ore  matched  to  the  waveforms  observed  at 
the  output  of  the  memoryless  part  of  the  channel. 

When  the  waveforms  iqCtii)!*^  are  not  linearly 

independent,  we  can  substitute  the  bank  of  M 

filters  1  Q*(  f ;  l)  1  ,  with  n  filters,  matched  to 

the  basis  functions  [  U^( f )  ,  according  to  the 

equation 


U(f)  -  m  f)  U  T(f) 


(2.28). 


COMPUTATION  Of  THE  TRANSFER  FUNCTION  OF  THE 
Cf*TIHUH  FHTSH 

From  the  results  presented  in  the  previous 
section.  It  Is  seen  that  the  transfer  function 
U(f)  of  the  optimum  receiving  filter  has  the 
general  expression 

U(f)  -  Q*(f)  H*’(f)  D*DB(f)  (5.1) 

where  B(f)  is  obtained  through  (2.14),  and  H(f) 
ia  defined  by  (2.20)  In  conjuction  with  (2.15). 
This  is  valid,  however,  only  if  the  conditions 
(2.14)  and  (2.15)  hold.  In  this  section  we  shall 
show  that,  with  the  model  assumed  for  the  chunnel 
(.see  Fig.1)  the  hypotheses  of  the  theorem  hold 
true,  and  an  algorithm  will  be  provided  to 
compute  the  quantities  Involved  in  (5-1 ) • 

To  do  this,  we  observe  that  under  our 
sasumptiona  the  discrete  sequence  (£n)fl_  w  forms 
an  M-atate  fully  regular  homogeneous  Markov 
chain.  Let  P  denote  the  transition  probability 
matrix  of  this  chain,  i-e.. 


(p>i,  -  p JU„-  il. 


and  define 


L(f)  -  S  Pnexp(-.i.btniT) 


this  cane, 
J*  r.  ::  f  1  - 


(2.25) 


A  -  [n{  1  )  ,a(2), - .(  V)  1  1 


Then,  it  cm:  tc  shown  that  the  expression  for  the 
optimum  filter  i  a 

U(t>e*pi-J..Tt  1  D*D<?T(f)  A  (3.3) 

whore  •;(!’)  t»  .-'ivcr;  by  (2.20),  «^  (  f )  by 
(2.36)  -  aru  f)  is  the  K -vector  whose 

components  are  ..(fiO . C(:';M). 

As  an  example  of  application,  consider  binary 
PdK  transmission  over  u  nonlinear  chunnel 
consisting  o !'  a  fin-order  Rutterworth  filter  with 
3-dll  bandwidth  .Ou/T  cascaded  to  a  nonlinear 
amplifier  (a  TAT  exhibiting  both  AM/AH  and  AM/PM 
convereionlr  r  i .4  snows  the  transfer  funotiori  of 
the  optimum  filter  for  e  channel  with  the 
r:onl  inesri  ty  removed.  Thu  powor  spectra  of  the 
received  signal  are  also  shown  for  comparison. 
The  performance  of  the  optimum  filter  is  shown  in 
Pig. 3-  for  comparison's  sake,  the  mean-square 
error  resulting  from  a  2-pole  Buttarvorth 
receiving  filter  with  optimum  3-dB  bandwidth  is 
a  1  ao  shown . 
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Pig.1  -  Model  of  a  d isc rete-inpu t ,  continuoa-output 
f lnitto-memory  channel 


»l«) 


•  <U 


Pig. 7  -  Model  of  the  receiver 


Pig. 3  -  Structure  of  the  optimum  receiving  filter 


ft'r  funi  tion  of  the  optimum 

J 1  up,  !'  1  tu r 

apedtrhl  density  of  the 
.  e  s  .  p  n  s  1 

’-order  Huttrrworth  TX  filter 

:  ’  ui 


Fig.O  -  ?^-PGK;  fltn-urder  duttorworth  TX  filter 
with  2b,  1  *  1.0b  .  Memi-square  error 
with  optimum  filter  (continuous  line) 
mid  with  ?-pole  Butterworth  filter 
(dnBhed  1  ir,?'  . 


Appendix  D 


AENSION  AL  MODULATION  AND  CODING  FOR  DIGITAL  TRANSMISSION 


ana  M  Ella  l ") 


performance  In  digital  data 
■at  Ion  can  be  achieved,  without 
ease  neither  In  bandwidth  nor 
by  adding  a  certain  amount  of 
.ty  in  the  transmi ssion  system. 

I s  to  do  that  are  an  Increase  in 
tens ions 1 i t y  of  the  signal 
tation  used  for  transmission,  or 
Cion  of  multilevel  trellis 
boeck")  codes.  He  consider  here 
nac ion  of  uoth  techniques.  A 
of  multidimensional 

lations,  called  "generalized 
codes"  is  introduced,  anq  it  is 
ow  an  llngerboeck  code  can  be 
d  using  one  of  those  as  an 


UKODUCTION 

,ltal  communication  over  radio 
.3,  both  available  spectrum  and 
•tter  power  are  generally 
I-  Thus,  to  oope  with  the 
'creasing  demand  for  digital 
tcation  services,  more  efficient 
.sslon  techniques  are  called  for, 
-he  search  for  bandwidth  and 
sfficlent  modulation  systems  has 
a  very  active  research  area.  In 
it  has  beer  widely  accepted  that 
^d  factor  is  relevant  in  the 
f  between  bandwidth  and  power, 
13  .  the  complexity  of  the 
Ication  system.  In  other  words, 
add:  t  lot  .1'.  amount  of  complexity 
>wed  at  the  transmitting  and  the 
•n8  ends  >;  tne  system,  its 
lance  tr  e  improved  without 
ling  ne.’her  bandwidth  nor  power. 

possib.e  wjy  to  allocate  the 
>nul  ut,  1  ex  1  ty  Is  to  use 
Ing  a. p" ibets  1 n  higher 
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dimensions.  Conventional  quadrature 
amplitude  modulation  (QAM)  and 
phase-shift  keying  (P3K)  use  the 
in-phase  and  quadrature  components  of  a 
carrier  to  build  up  a  two-dimensional 
(20)  signal  apace.  Now,  by  choosing  an 
appropriate  alphabet,  the  performance 
can  be  improved  by  going  to 
higher-dimensional  signal  spaces. 

Although  our  main  concern  here  is 
the  discussion  of  specific  designs,  it 
appears  also  natural  to  Investigate 
what  performance  improvement  can 
theoretically  be  achieved  when  this 
dimensionality  is  increased.  Using 
sphere-packing  arguments,  it  was  shown 
in  [10]  that  for  a  signalling  alphabet 
whose  elements  have  the  same  energy  E, 
the  dimensionality  n,  the  number  of 
signals  M,  and  the  minimum  Euclidean 
distance  d^„  in  the  signal  space  are 
re.ated  by  the  bound 


M  <  2  /  Iq((n-lV2  ,  1/2) 

where  qsd^  ME,  and  I  (x,y>  is  the 
Incomplete  beta-function. 

Similar  results  can  be  obtained  by 
allowing  the  signals  to  have  more  than 
one  energy  level.  To  do  this,  we 
assume  that  the  energy  spacing  between 
adjaoent  levels  is  equal  to  the  minimum 
squared  distance  between  signals,  and 
apply  the  bound  just  given  to  each 
energy  level.  This  results  In  a  set  of 
nonlinear  equations  (with  as  many 
equations  as  energy  levels)  ,  which  can 
be  solved  numerically  and  whose  results 
are  shown  in  Fig.O  .  In  this  figure, 
the  abscissa  logt M/n  represents  the 
Information  rate  in  bits  per  dimension, 
and  the  ordinate  is  labeled  by  a 
parameter  related  to  the  error 
performance  of  the  signal  set  when  used 
un  an  additive  white  Gaussian  noise 
channel.  More  specifically,  the  error 
probability  turns  out  to  be  a  monotone 
decreasing  function  of  the  product  of 
-^ / N#  (the  ratio  between  the  average 
energy  pap  bit  and  the  noise  power 
spectral  density)  and  log.M,  where 
&•*>  sd»i«  /£■  Thus,  given  two  systems 
opersting  at  the  same  average  energy  E 
and  information  rate,  the  one  having  a 
larger  value  of  log,M  will  exhibit 
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for  h  ig:;  tv  S9  ■  •..  •■‘f  performance  chart 

of  F  i  .  O  v.  a  s  in's1,  proposed  In  ['!]). 

're  gain  involved  In 
i  <j  i m ensicrid’  U.y  is 
-,t  :.  s  seen  trial  as  the 
■ j le  ge is  larger, 

v,!-:  <»,••  of  energy  levels 

small  values  of  n  . 

K  .  *.  L  signal  spaces  can 

;>  p  >  i  r  e  :  tai.eoosi  /  using  two 

p  n  u  s  v  -  o  ••  t  ■] :  go o .  corners  with  two 
:  r. _  gc  n  a :  .  \  v  1  ed  electromagnetic 
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.  :  ;  jr  f a  i,,  v  division  multiplexed 
s  .  ,■.  r  a  1  s  .  ivi  ;s  s  on  specific  designs 
if  and  -.ode  sets  can  be  found 

1  ’  -  '■  }  .  They  show  for  example  that., 

•  •  :::-t if  are..!  t  f!  Jr'-',  the  signal  power 

•  at  :  an  e  ..  a  .  c  d  to  achieve  a  bit 
error  r.o'.e  cf  j's  is  about  1.?  dB 
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codes.  Generalised  group  codes  form  ar. 
exceedingly  large  class  of  alphabets, 
and  in  fact  most  of  the  alphabets  that 
nave  been  proposed  so  far  for 
multidimensional  signalling  belong  to 
this  family.  Techniques  are  shown  that 
allow  multid.mensicnal-alphabet 
vn.erboeck  codes  to  be  designed:  they 
are  based  cn  a  pa-tit-on  of  the 
alphabet  stemming  from  trie  partition  of 
a  group  into  cosets  of  a  convenient 
s  a  h  g  r  c  u  p  . 


2.0  GENERALIZED  CROUP  CODES  FOR  THE 
GAUSSIAN  CHANNEL 

Consider  a  set  of  K  n-vectors  1  x‘,}  ,  ... 

,  x^1}  ,  and  a  set  of  n  x  n  orthogonal 

matrices  S, ,  . . .  ,  SL  that  form  a 

finite  group  G  under  matrix 
multiplication.  The  set  of  vectors 
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■  ••ease  J  complexity 
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efficiency.  W  i  t  h 
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generalized  group  code" 
( G G C )  if  the  action  of  the  matrix  group 
G  on  the  "initial  set"  of  vectors  is 
such  that  each  of  them  is  transformed 
-nto  an  equal  number  of  distinct 
vectors.  The  special  case  K; 1  has  been 
studied  in  [ g ] ,  and  gives  rise  to  a 
class  of  alphabets  called  "group  codes 
for  the  Gaussian  channel".  Notice  that 
the  number  of  distinct  vectors  in  a  GGC 
may  be  ieSs  than  KL  (but  is  always  a 
multiple  of  <),  and  these  vectors  may 
not  span  the  Euclidean  n -dimensional 
s  p  a  c  e  . 


(0  (i) 

x;  '  °i  x) 


is  called  a 


If  I  x |  denotes  the  Euclidean 
length  of  a  vector  x,  the  quan  t  i  t  y  I  x  |*i  s 
proportional  to  the  energy  of  the 
signal  associated  with  x. 


As  orthogonal  matrices  transform  a 
v" ''or  into  another  having  the  same 
.•*nglh,  a  uGC  has  as  many  energy  levels 
a"=  there  are  in  the  "initial  set"  { * 

,  x»)  In  the  special  case  K- 1 , 
all  the  code  vectors  have  the  same 

.  d  1 1  gt.  h  . 


Let  now  Ry  denote  the  maximum 
likelihood  (ML)  region  associated  with 
b':e  code  vector  x'“,  i.e.,  the  3et  of 
Joints  in  the  Euqlidean  n-space  at 


■  ist  as  close  to 


“«V 


J 


t  3s  to  any  other 
•■Jof*  vector.  Then,  for  a  GGC  every  ML 
■■gion  is  congruent  to  one  among  r!"  , 
K  1 ’’  ether  words,  the’ ML 

i  r g  ions  can  have  r.o  more  than  K 
■  •  ‘  fiment  shapes  Consequen'iy,  if  a 
*  neceiver  Is  used  and  P(elx.1"'  denotes 
tne  error  prvbdh.; Sty  when  x/1  is 
-  -  an sm » t ted ,  this  cun  tawe  no  more  than 
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;ccn  that  a  sufficient  condition  is  C2  =  0  ,  or  Ct  =  0.  The  first  condition  is  equivalent 
-o  sta'e  that  for  a!!  the  encoder  states  the  symbols  available  to  the  encoder  have  zero 
nean.  For  equally  likely  source  symbols,  this  condition  reduces  to  the  requirement  that 
ill  the  columns  of  the  table  displaying  the  values  of  the  function  h(  ,  •)  have  a  zero 
irithmetic  mean.  The  second  condition  is  equivalent  to  state  that,  for  each  encoder 
tate,  the  average  of  the  symbols  forcing  the  encoder  to  that  state  have  zero  mean. 

Consider  then  the  line  spectrum  (2.19).  A  sufficient  condition  for  it  to  be  zero  is 
hat  fi  —  0,  i.e.,  the  average  of  the  symbols  at  the  encoder  output  be  zero.  From  (2.14) 
ve  see  that  if  C2  =  0  then  fi  —  0,  so  that  if  C2  —  0  encoding  does  not  alter  at  all  the 
>ower  spectrum  of  the  line  signal. 

Let  us  now  restrict  our  attention  to  the  fngerboeck  code  described  in 
.  1  .  it  ic  enemy  s*-en  that  ^f  the  source  symbols  are  eaually  likely  we 
e  for  this  code  =0.  More  generally,  the  condition  C  .=0  is  satisfied  by 
t  Ungerboeck  codes  designed  so  far,  at  least  when  the  source  symbols  are 
ally  likelv.  Th ,  s  is  due  t  c  the  highly  symmetrical  structure  exhibited 


ally.the  fourth  quantity  of  interest  is  the  mean-square  value  of  the  encoded 


L—i  N 

Co  =  £  £>fcp*IM*.S/)|5 

i— o 

/,-! 


(2.15) 


=  E 1>u‘ 

fc  — o 

he  asterisk  denotes  conjugate. 


are  dow  in  a  position  to  express  the  power  spectrum  of  the  modulated  signal 


13).  We  have 


$,(J)  =  Sic){f)  +  5id) 


(2.18) 


the  continuous  part,  is  given  by 


5<,c)(/)  =  IW)»2{^(Co-'<.!2'+:^(c;(i  -p“)B(/)c;)}  (2.17) 


)  is  the  Fourier  transform  of  the  waveform  q(t)  available  to  the 
duiator,#  denotes  real  part,  '  denotes  the  N  x  N  identity  matrix,  and 

B (/)  =  e~>2wfT  V"  (P  -  p oo^e-j2^fnT 


n  —  0 


=  \e}2wfTl  -  (F  -  P°°)  I 


1-1 


(2,18) 


i(J  for  non  details). 

ie  discrete  part  of  the  power  spectrum  (line  spectrum)  is  given  by 


5{f\n  = 


i<?(/)i: 


T‘ 


E  <(/-’>/r). 


(2.19) 


n«=-»  —00 


nclusionn 


'■  are  now  ;n  a  position  to  state  simple  sufficient  conditions  that  a  code  must 


for  its  sp*-.  ‘ruin  to  be  equal  to  a  scalar  multiple  of  I  Q[f)\' .  From  (2.18)  it  can  be 


*  *■ 
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exists  under  our  assumptions,  and  has  all  its  rows  equal  to  the  probability  vector  p. 

Let  us  now  evaluate  four  quantities  which  play  an  important  role  in  the  discussions 
that  follow.  The  first  among  them  is  the  average  value  of  Uk  taken  over  the  source 

symbols,  i  'a, 

L-  i 

Q>=5J^Ujt-  (2.10) 

k  —  U 

The  j  -th  component  of  C2  turns  out  to  be  the  average  of  the  symbols  available  to  the 
encoder  when  it  is  in  state  Sy. 

The  second  quantity  to  be  defined  is  the  N -vector  C]  whose  y~th  component  is 
the  average  of  the  coded  symbols  that,  when  output  by  the  encoder,  force  it  to  the  state 

S',  This  y~th  component  of  Cj  is  then  given  by  (see  [3J  for  details): 

L-l  N 

[ci  lf=EE  qkP(  fE*  ki  Hk>  si)  (2.11) 

Jt — 0  t~*  1 

(recall  that  (Ek  ] tj  =  1  only  if  the  source  symbol  k  takes  the  encoder  from  state  St  to 
state  Sj).  If  we  define  the  N  X  N  diagonal  matrix 
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U3  =  (w6  w1  w*  w5) 


(2.2  d) 


*  1  1 


)eaoting-  by  ,  k  —  0.  1 ,  .  .  ,  L  —  l,  the  probabilities  of  the  source  symbols,  i.c., 


qk  =  p{on  —  k},  Jc  —  0, 1, . . . ,  L  —  1, 


tate  sequence  (<rn)  is  a  homogeneous  Markov  chain  with  transition  probability 


P  —  ) 


!P]„  =  P{<7„,  -“Sj!  <r„  =$0  (2.6) 

[3}  for  a  proof ).  We  assume  that  the  chain  is  fully  regular  (see  f4]  for  the  relevance 
iis  assumption  ;n  the  computation  of  power  spectra  ).  The  matrix  P  provides  all 
informations  we  need  about  the  statistics  of  the  sequence  of  the  encoder  states.  In 
icular,  the  stationary  state  probabilities 


Pi  =  P{on  —  Si  },  i  =  1,2, ,  N 


,he  elements  of  the  row  /V  vector  p  obtained  as  the  solution  of  the  equation 


p  =  p  r 


ect  to  the  rondition  p,  =  1.  The  limiting  transition  probability  matrix 


P°°  =  lirn  Pn 


For  our  computations,  the  following  quantities  are  to  be  defined: 


The  state  transition  matrices  E* ,  k  —  0,  1 , . . .  ,  L—  1,  which  are  the  TV  X  TV  matrices 
whose  entry  [E*;],,,  is  equal  to  !  if  g(k,S,)  --  .S’,,  and  zero  elsewhere. 

The  row  vectors  U*,  k  --  d,  l,  ,L  —  I,  whose  N  entries  are  the  coded  symbols 
according  to  the  rule  [U*],-  —  ht.k,  S,). 


In  words,  the  matrix  E*  has  a  l  m  row  i  and  column  j  if  the  source  symbol  k  forces 
a  transition  of  the  encoder  from  state  5,  to  state  Sy,  and  a  zero  elsewhere.  The  vector 
Ujt  includes  the  coded  symbols  corresponding  to  the  source  symbol  k,  for  different  states 
of  the  encoder.  Fot  the  example  of  Fig.l,  we  have 


and 


r i ooo- 

00  10 

Eo  =  Ei  — 

10  0  0 

^0010- 
rO  10  0* 

!  0  0  0  1 
Eo  “  E3  =  : 

;  0  1  0  0 

i 

^0  0  0  1  - 


(2.1a) 


(2.16) 


U0  ~~  ( yj°  v)'  vr  w 3)  (2.2a) 

Hi  ■—  ( v/‘  in5  wn  w1)  (2.26) 

U'j  —  [v>7  v/  w°  ud)  (2.2c) 


< 


the  modulator,  the  modulated  signal  can  be  written  in  the  form 


CO 

2(0=  u"9(t"n7’)  (L3) 

n—  ~  oo 

We  want  to  compare  the  power  spectrum  Qx{f)  the  signal  x(t)  with  the  power 

jectrum  £.(/)  of  the  signal  x{t)  which  would  result  if  the  source  symbols  were  not 

linear  . 

ncoded,  but  sent  directly  to  a  WU I  modulator.  Simple  sufiicient  conditions  will  be 

,und  showing  that  a  properly  designed  Ungerboeck  code  does  not  alter  the  power 
pectrum  of  the  line  signal.  This  provides  a  formal  proof  of  the  often-claimed  fact  that 
Ingerboeck  codes  "do  not  expand  the  bandwidth  . 

!.  The  power  spectrum  of  the  coded  signal 

Ln  this  section  we  shall  present  the  power  spectrum  of  the  signal  x{t)  defined  in 
1.3).  As  the  computation  can  be  done  by  using  results  obtained  by  Cariolaro  ct  ai. 
3],  we  shall  not  repeat  their  derivation  here,  and  focus  instead  on  the  meaning  of  the 
quantities  involved. 

To  describe  the  encoder  operation,  we  need  to  specify  the  functions  h(-t  •)  and  g{-,  •) 
af  (1.1)  and  (1.2).  This  will  be  given  in  tabular  form,  by  providing  two  tables  whose  rows 
are  labeled  according  to  the  values  taken  on  by  the  source  symbols  a„(say,  0, 1,  1) 

and  whose  columns  are  labeled  according  to  the  values  takcD  on  by  the  states  o n  (say, 
Si,  S2t.  ..,SN)-  In  the  first  table  we  display  the  values  of  h[an,on),  and  tn  the  second 
the  values  of  g(anyon).  As  an  example,  Fig.l  provides  the  two  tables  needed  to  describe 
the  4-state  Ungerboeck  code  of  [l,Fig.7j. 


# 
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1.  Statement  of  the  problem  and  motivation  of  the  work 


In  this  paper  wo  consider  the  power  spectrum  of  the  digital  signal  resulting  after 
channel  coding  and  multilovel/phase  modulation.  The  family  of  codes  considered  are 
those  introduced  by  Ungerboeck  fl].  Our  aim  is  to  find  simple  sufficient  conditions  for 
the  resulting  spectrum  to  be  equal  to  the  spectrum  of  an  uncoded  signal. 

The  encoder  is  modeled  as  in  [2 j .  A  finite-state  sequential  machine  with  N  states 
is  driven  by  a  stationary  sequence  (a^^Lm-oo  ° f  independent  source  symbols  taking  on 
values  0,1 L  —  1,  and  emitted  every  T  seconds.  If  (<7r»)^L-oo  denotes  the  sequence 
of  states  of  the  sequential  machine,  and  («n)^L_oo  the  sequence  of  coded  symbols,  the 
behavior  of  the  encoder  is  described  by  the  equations 

un  =  h(a„,an)  (1.1) 

and 

Vn+l  =  9  Wn,°n)  (1.2) 

Eq.  (1.1)  describes  how  the  encoded  symbol  un  depends  on  the  source  symbol  o„ 

and  on  the  actual  state  an  of  the  encoder.  Eq.  (1.2)  tells  in  which  state  the  sequential 

machine  is  forced  to  move  at  time  instant  (n  +  \)T  when  it  was  in  state  an  at  time  nT 

and  the  source  symbol  un  is  fed  to  the  encoder. 

n  linear 

The  coded  seuuence  ( un )  is  then  sent  into  Ml  modulator.  This  can  be  seen  as  a 
mapping  of  the  symbols  un  into  waveforms  which  are  output  sequentially.  If  the  coded 
symbols  take  on  values  in  the  set  {0, 1, . . . ,  M  —  1}  and  q(t)  is  the  waveform  available 
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UNGERBOECK  CODES  DO  NOT  SHAPE  THE  SIGNAL  POWER  SPECTRUM 
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Appendix  E 


code  with  two  energy  levels,  obtained 
from  the  initial  set  of  4  vectors: 


c  c  c  0 

-  b  c  0  c 

c  -b  0  c 

c  0  -  b  c 


with  c  -  0  •  3  8  9  and  b-.0-939-  Applying  to 
tr,e  initial  set  the  same  group  of 
matrices  c the  previous  example,  we 
get  a  1 2&-syn.bc  1  alphabet .  32  symbols 

nave  energy  }.c2  .  and  96  symbols  have 
energy  b*  *  — 2  .  Tne  average  energy  is 


'  9  6(  p7  .  .  .  *  )  *  ?2( ?c3  ) ] / 1 28  s  1  • 


This  al; habet  cun  be  used  to 
design  an  Ur.gc-rboeck  code  with  rate 
6/?.  Fig  9  shown  the  8-state  trellis 
employe.!.  A  f  a  partition  of  the 
alphabet  into  16  subsets  of  8  vectors 
each  stems  from  the  subgroup  l  T  ,  —  I }  , 
where  I  is  the  nxn  identity  matrix. 
For  example,  the  first  coset  includes 
the  initial  set  and  its  negative.  We 
go  t 


;  =  o .  6  o  4 
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which  is  1 . ' t  cl  greater  than  the 
minimum  distance  achieved  by  using  two 
independent  8-PSK  signals  with  the  same 
energy  . 
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K  different  values,  and  the  average 
error  probability  is  given  by 

p,e)  s  1-  jr  P(  e  I  x-”) 

K  J  •  t 

Fig.1  shows  a  familiar  example  of 
a  signal  alphabet  (the  conventional 
' 6-QAM)  being  a  CGC.  It  has  parameters 
M -  1  &  ana  r.  =  2.  Points  1,2, 3. 4  denote 

tne  feu r  vectors  in  the  initial  -set. 
".he  matrices  generating  the  code  are 
■  Oic  associated  to  plane  rotations  by 
multiples  of  90°  There  are  three 

different  energy  levels,  and  three 
different  shapes  of  MU  regions  (their 
boundaries  are  snown  in  F  i  g  ■  1  )  - 
consider  no-1  the  3et  of  M  vectors  in  a 
GCC,  and  a  partition  of  it  into  m 
disjoint  subsets  X,  ,  ...  ,  .  For 

each  subset  Kf  ,  we  can  define  the 
" i ntrad 1  stance  set"  as  the  set  of  all 
tne  distances  among  pairs  of  vectors  in 
Xj,  We  say  that  a  partition  of  a  GGC 

is  "fai'-”  if  all  the  subsets  X(  include 
tne  same  number  of  vectors,  and  their 
'.".trail  stance  sets  are  equal. 

A  wav  to  generate  a  fair  partition 
: T  a  GGC  is  the  fallowing.  Partition 
tn»  matrix  group  G  into  the  cosets  of  a 
subgroup  H,  and  apply  each  cosot  to  the 
initial  set  cf  vectors.  It  can  be 
shown  that  the  resulting  partition  of 
the  GCC  is  fair.  For  example,  by 
partitioning  the  rotation  group  used 
; or  the  CGC  of  Fig.l  into  the  two 
..sets  associated  to  the  rotations  0 *  , 
■.*  and  9i  ,  -90',  respectively,  the  code 
is  fni'-'.y  partitioned  into  the  two 
s  _-ocr.ee  s  ('.,2,3,4,9,10,11,12)  and 
‘i ,  fc  ,  V  ,  3  ,  i  'i  ,  1 « ,  1  ‘j  ,  1 6 )  . 


i.Z  .MULTIDIMENSIONAL  UNGERBOECK  CODES 

We  srali  now  see  how  an  Ungerboeck  code 
■■•a:'.  be  designed  using  a 

••Itidimensional  alphabet  generated  as 
-  Section  2.  For  simplicity,  we  shall 
restrict  our  attention  to  codes  with 
rate  m/(m»1).  m-log,M.  Such  codes  can 
be  specified  by  giving  an  N-state 
trellis,  with  m  branches  stemming  from 
every  node  and  M  branches  reaching 
every  sole.  The  branches  are 

associate!  to  the  elements  of  an 
alphabet  with.  ?M  elements  (see  [6]  for 
mere  details)  .  Although  no  formal 
proof  has  been  found  yet,  for  a  good 
coda  it  is  conjectured  that  the 
assignement  cf  symbols  to  the  branches 
mould  exhibit  a  regular  and 
symmetrical  structure.  This  in  turn 
can  be  obtained  by  assigning  to  each 
node  and  tu  tr.e  u'arches  connecting  any 


pair  of  nodes  a  subset  of  symbols 
obtained  from  a  fair  partition  of  the 
alphabet.  In  fact,  all  the  designs  of 
Ungerboeck  codes  known  to  the  authors 
exhiDit  this  kind  of  symmetry,  although 
no  systematic  procedure  to  achieve  it 
has  been  proposed  before 


4 . 0  SOME  EXAMPLES 

We  shall  now  describe  in  some  details 
two  examples  of  designs  of 
four-d imens  Ion al  Ungerboeck  codes.  No 
attempt  at  optimizing  the  designs  has 
been  undertaken  yet,  so  these  examples 
3hould  be  taken  as  an  illustration  of 
the  concepts  presented  before  3nd  not 
as  a  list  of  good  4D  codes. 

The  first  example  originates  from 
a  group  code,  obtained  by  permuting  the 
components,  and  replacing  them  with 
their  negatives,  in  the  initial  vector 
(a,a,a,0),  a=1  /yfT.  This  alphabet 
includes  32  vectors.  The  2-state 
trellis  of  Fig. 2  can  be  used  to  design 
an  Ungerboeck  code  with  rate  4/9,  and 
hence  transmit  4  bits  per  alphabet 
s  ymbol  , 

Fig. 3  shows  a  fair  partition  of 
t  lie  alphabet  in  four  subsets  of  8 
vectors  each.  This  partition  is 
obtained  as  follows:  denote  by  c i  the 
orthogonal  matrix  whose  effect  on  a 
vector  is  to  cyclically  shift  its 
components  to  the  right  by  one 
position,  and  to  change  sign  to  the 
second  component.  Then  the  set 


H=  (a*,  a',  aJ,  a5,  a‘,  a5,  a‘,  a7) 


is  a  cyclic  subgroup  of  the  group  G 
generating  the  alphabet,  and  its  cosets 
generate  the  fair  partition  of  Fig. 3. 
By  associating  th»  first  node  of  the 
trellis  the  subcodes  A  and  B,  and  to 
the  second  node  the  subcodes  C  and  D, 
the  free  distance  of  this  code  is 


If  this  figure  is  compared  to  the 
minimum  distance  achieved  by 
transmitting  the  same  amount  of 
information  over  the  same  number  of 
dimensions  using  two  inde'-ndent  4-PSK 
signals,  we  see  that  an  nergy  saving 
of  1.24  dB  is  obtained. 


Consider  now  a  generalized  group 


n  s,  I  S2  S 


O  I  4  i  3  ■* 

W  !  W1  Wa 


1  |  W6  W' 


via  e 


2  j  vo1  w3  w°  vJ1 


3  w6  vi7  w*  *!* 
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